Version: Next

FASTA

Types of data loaded#

Features and their sequences. Will create a feature for each entry in a fasta file and set the sequence, the class of the feature to create is set for the whole file.

How to download the data#

N/A - will parse any file in FASTA format

How to load the data into your mine#

Project XML example

<source name="flybase-dmel-gene-fasta" type="flybase-dmel-gene-fasta">
<property name="flybase-dmel-gene-fasta.taxonId" value="7227"/>
<property name="flybase-dmel-gene-fasta.dataSetTitle" value="FlyBase fasta data set for Drosophila melanogaster"/>
<property name="flybase-dmel-gene-fasta.dataSourceName" value="FlyBase"/>
<property name="flybase-dmel-gene-fasta.className" value="org.intermine.model.bio.Gene"/>
<property name="flybase-dmel-gene-fasta.classAttribute" value="primaryIdentifier"/>
<property name="flybase-dmel-gene-fasta.includes" value="dmel-all-gene-*.fasta"/>
<property name="src.data.dir" location="/DATA/flybase/fasta"/>
<!-- add licence here -->
<property name="flybase-dmel-gene-fasta.licence" value="https://creativecommons.org/licenses/by/4.0/"/>
</source>
attributecontentpurpose
taxonIdspace-delimited list of taxonIdsonly features with the listed taxonIds will be loaded
classNamefully-qualified class namedetermines which feature will be loaded
classAttributeidentifier field from classNamedetermines which field from the feature will be set
dataSetTitlename of datasetdetermines name of dataset object
dataSourceNamename of datasourcedetermines name of datasource object
src.data.dirlocation of the fasta data filethese data will be loaded into the database
includesname of data filethis data file will be loaded into the database
sequenceTypeclass nametype of sequence to be loaded
loaderClassNamename of Java file that will process the fasta filesonly use if you have created a custom fasta loader
licenceURL pointing to standard data licence for dataupdates DataSet.licence with value