Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Investigate Yeastract and SGD APIs for regulatory information #949

Closed
dondi opened this issue Jan 11, 2022 · 7 comments
Closed

Investigate Yeastract and SGD APIs for regulatory information #949

dondi opened this issue Jan 11, 2022 · 7 comments

Comments

@dondi
Copy link
Owner

dondi commented Jan 11, 2022

SGD may be YeastMine. We want to look for regulation information there if available.

In the absence of API availability, we may need to develop a mechanism for storing the GRNs in our own database, and importing into them based on published files.

@Onariaginosa
Copy link
Collaborator

During this weekly session I went to YeastMine and selected the regulators tab. From there I was able to get all of regulator genes, and search for the targets of each regulator. YeastMine provided code to do this, so I am currently creating a file to generate all of the regulator and target genes.

During the meeting @kdahlquist mentioned that I should generate 2 adjacency matrices, a 295x295 regulator by regulator matrix that is primarily used in GRNmap, as well as the 295x???? regulator by target matrix.

@kdahlquist
Copy link
Collaborator

200x6000 regulator X target

I'm not sure how many genes SGD will classify as a regulators, it will be ~175-250.

@Onariaginosa
Copy link
Collaborator

I created the script to generate the regulators by regulators adjacency matrix as well as the regulators by targets adjacency matrix. There were 294 genes classified as regulators, and I generated all of their targets. Some genes did not have a standard name, so in the script they are not currently in the matrices.

@kdahlquist
Copy link
Collaborator

When there is no standard name, we need to substitute the systematic name instead.

@Onariaginosa
Copy link
Collaborator

I made the appropriate fix and regenerated the adjacency matrices.

@dondi
Copy link
Owner Author

dondi commented Feb 15, 2022

Decided at the meeting to commit this work to a new top-level folder called database. .gitignore will also be revised to avoid accidental committal of files produced by these scripts

@Onariaginosa
Copy link
Collaborator

This is complete

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants