Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error in making result file #274

Open
Rhia15 opened this issue Jun 24, 2023 · 3 comments
Open

Error in making result file #274

Rhia15 opened this issue Jun 24, 2023 · 3 comments

Comments

@Rhia15
Copy link

Rhia15 commented Jun 24, 2023

Hi,

I have been trying to use find_enrichment.py to perform an enrichment analysis. I originally tried it with my study but it did not work so I tried it with the data provided on the github page as well as the command line example but seem to receive the same error message. It doesnt produce a results file either. Is there something else which i need to do before?

(enrichment) [mbxrs14@login001(Augusta) enrichment]$ find_enrichment.py small_study.txt small_population.txt small_association.txt --pval=0.05 --method=fdr_bh --pval_field=fdr_bh --outfile=results_id2gos.xlsx
go-basic.obo: fmt(1.2) rel(2023-06-11) 46,420 Terms
**WARNING: GO:0000229 NOT FOUND IN DAG
**WARNING: GO:0004871 NOT FOUND IN DAG
**WARNING: GO:0007568 NOT FOUND IN DAG
**WARNING: GO:0008565 NOT FOUND IN DAG
**WARNING: GO:0051186 NOT FOUND IN DAG
HMS:0:00:00.040034 6,267 annotations READ: small_association.txt
Study: 38 vs. Population 2000

WARNING: only 0.39473684210526316 fraction of genes/proteins in study are found in the population background.

ERROR: only 0.39473684210526316 of genes/proteins in the study are found in the background population. Please check.

Any help would be greatly appreciated thank you!

@tanghaibao
Copy link
Owner

@Rhia15

Can you check to see if most of the genes in your study (38 genes) is a subset of the population (2000 genes)?
The population can be the entire protein set in the genome of interest.

@Rhia15
Copy link
Author

Rhia15 commented Jun 24, 2023

Hi, thank you for the reply! the input in the example is the txt files I downloaded from this github page in test/data to check if there was anything wrong with the environment or any dependencies. I think it is a subset of the population. It looks like I get the the same warnings about DAG with any data I try though. Could there be something wrong with the DAG? but I have checked the permissions for go-basic.obo and if the IDs are present and everything seems okay, so I am not sure what to do :/

@tanghaibao
Copy link
Owner

@Rhia15

I just tested with the test files in this repo, it seems to run fine. Have you checked if you have the latest version?

$ find_enrichment.py data/study.txt data/population.txt data/association.txt --pval=0.05 --method=fdr_bh --pval_field=fdr_bh --outfile=results_id2go.xlsx

**WARNING: GO:0000059 NOT FOUND IN DAG
**WARNING: GO:0000060 NOT FOUND IN DAG
...
HMS:0:00:00.263267 127,967 annotations READ: data/association.txt
Study: 276 vs. Population 33239
Load BP Ontology Enrichment Analysis ...
Propagating term counts up: is_a
 87% 28,948 of 33,239 population items found in association

Load CC Ontology Enrichment Analysis ...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants