monarch-mapping-commons

This repository contains the source code which generates the SSSOM-style mapping files used for the Monarch Initiative knowledge graph.
The pipeline is run via Jenkins and the resulting mapping files are uploaded to Google Cloud Storage, hosted at https://data.monarchinitiative.org/mappings/

Repository Structure

config/ - configuration files
- project-cruft.json - edit this if you need to change any of the project template values in .cruft.json
mappings/ - SSSOM mapping files (do not edit these)
scripts/ - scripts for processing mappings (ok to edit these as needed)
src/monarch_gene_mapping - source code for Monarch Gene Mapping (ok to edit these as needed). See monarch_gene_mapping/README.md for more information.
Makefile - Default makefile generated by the cookiecutter (Do not edit this file)
monarch_mapping_commons.Makefile - Custom makefile with additional targets specific to this project (ok to edit this file)

Usage

Prerequisites

Installation

git clone https://github.com/monarch-initiative/monarch-mapping-commons.git
cd monarch-mapping-commons
poetry install

Running

make mappings

Note:
The first time you run this command, it will take a while to download and process the data.
Subsequent runs will be much faster.
This is because Monarch Gene Mapping depends on a very large (11gb) file from UniProtKB.
Future plans are in place to cache this file in Google Cloud Storage, or to use the UniProtKB API,
but for now, the file must be downloaded in its entirety.

Developer Documentation

To update the mapping registry from OLS:

sh odk.sh make update_registry -B

To update the mappings:

sh odk.sh make mappings

If the run requires a recently published SSSOM or OAK feature, first update ODK:

docker pull obolibrary/odkfull:dev

and then run the dependencies goal together with the mappings goal:

IMAGE=odkfull:dev sh odk.sh make mappings

For Windows, append :dev to obolibrary/odkfull in the odk.bat file.

Note: If running on a Windows machine, replace sh odk.sh with odk.bat in the above commands.

Design decisions:

Only mappings of base entities are extracted. This ensures that we do not import the same UBERON mapping for every species specific anatomy ontology (XAO). This is realised as a filtering step that relies on the crude assumption that the ontology ID is somehow reflected in the subject_id.

Credits

This project was made with the mapping-commons-cookiecutter.

Name		Name	Last commit message	Last commit date
Latest commit History 311 Commits
.github		.github
config		config
mappings		mappings
metadata		metadata
scripts		scripts
src/monarch_gene_mapping		src/monarch_gene_mapping
tests		tests
.cruft.json		.cruft.json
.env		.env
.gitignore		.gitignore
Jenkinsfile		Jenkinsfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
docker-compose.yml		docker-compose.yml
monarch_mapping_commons.Makefile		monarch_mapping_commons.Makefile
odk.bat		odk.bat
odk.sh		odk.sh
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
registry.yml		registry.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

monarch-mapping-commons

Repository Structure

Usage

Prerequisites

Installation

Running

Developer Documentation

Design decisions:

Credits

About

Releases 3

Packages

Contributors 9

Languages

License

monarch-initiative/monarch-mapping-commons

Folders and files

Latest commit

History

Repository files navigation

monarch-mapping-commons

Repository Structure

Usage

Prerequisites

Installation

Running

Developer Documentation

Design decisions:

Credits

About

Resources

License

Stars

Watchers

Forks

Releases 3

Packages 0

Contributors 9

Languages

Packages