Skip to content

Easily fetch biological sequences from online sources

License

Notifications You must be signed in to change notification settings

BioJulia/BioFetch.jl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BioFetch.jl

Easily fetch biological sequences from online sources

BioFetch provides a higher-level interface to retrieve data from sequence databases and provide them in a readily manipulatable form via FASTX.jl and GenomeAnnotations.jl.

Currently supports Entrez (NCBI) Nucleotide and Protein databases, as well as UniProt and Ensembl.

Examples:

fetchseq("AH002844")                             # retrive one NCBI nucleotide record as FASTA
fetchseq("CAA41295.1", "NP_000176", format = gb) # retrieve two NCBI protein records as GenBank Flat File
fetchseq("Q00987")                               # retrieve one UniProt protein record as FASTA
fetchseq("ENSG00000141510")                      # retrieve one Ensembl gene record's proteins as FASTA
fetchseq("NC_036893.1", 81_775_230 .+ (1:1_000_000))         # retrive a 1 Mb segment of a FASTA NCBI genomic record
fetchseq("NC_036893.1", 81000000:81999999, true) # retrive a 1 Mb segment of a FASTA NCBI genomic record on the reverse strand

About

Easily fetch biological sequences from online sources

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages