InterMine is an open source data warehouse system for the integration and analysis of complex biological data, developed for the last 10 years by the Micklem Lab at the University of Cambridge. InterMine has been used for developing data warehousing solutions for a number of projects, including for storage and analysis of modENCODE data, and as a data mining platform for a number of major model organism databases as part of the InterMOD project.
InterMine has been developed with the support of the Wellcome Trust , , , as well as support from the National Human Genome Research Institute [R01HG004834]. The Wellcome Trust also recently granted a further 5 years of funding for InterMine development, as well as development of HumanMine, a data warehouse of human genetic, genomic and proteomic data, ensuring continued development of InterMine as a framework.
The publicly available InterMine instances include:
- FlyMine - a data warehouse of integrated fruit fly genetic, genomic and proteomic data
- YeastMine - an integrated data warehouse of yeast genomic data, developed by SGD
- RatMine - an integrated data warehouse of rat genomic data, developed by RGD
- MouseMine - an integrated data warehouse of mouse genomic data, developed by MGI
- TargetMine - a data warehouse for candidate gene prioritisation and drug target discovery, developed at NIBIO, Japan
- Zebrafishmine - an integrated data warehouse of zebrafish genomic data, developed by ZFIN
- Thalemine - a data warehouse for Arabidopsis thaliana Col-0 for the ARAPORT project
- PhytoMine - an integrated data warehouse of over 50 plant genomes from Phytozome.
See the InterMine registry for the full list of InterMine instances.
All InterMine code is freely available under the open source LGPL license.