BioRuby

Bioinformatics

% sudo gem install bio

% su 
# ruby setup.rb

> gem install bio

#!/usr/bin/env ruby
require 'bio'

# create a DNA sequence object from a String
dna = Bio::Sequence::NA.new("atcggtcggctta")

# create an RNA sequence object from a String
rna = Bio::Sequence::NA.new("auugccuacauaggc")

# create a Protein sequence from a String
aa = Bio::Sequence::AA.new("AGFAVENDSA")

# you can check if the sequence contains illegal characters
# that is not an accepted IUB character for that symbol
# (should prepare a Bio::Sequence::AA#illegal_symbols method also)
puts dna.illegal_bases

# translate and concatenate a DNA sequence to Protein sequence
newseq = aa + dna.translate
puts newseq # => "AGFAVENDSAIGRL"

#!/usr/bin/env ruby

# you can use Bio::Sequence object as a String object to print, seamlessly
dna = Bio::Sequence::NA.new("atgc")
puts dna        # => "atgc"
str = dna.to_s
puts str        # => "atgc"

#!/usr/bin/env ruby

require 'bio'

# create a DNA sequence
seq = Bio::Sequence::NA.new("atggccattgaatga")

# translate to protein
prot = seq.translate

# prove that it worked
puts seq   # => "atggccattgaatga"
puts prot  # => "MAIE*"

#!/usr/bin/env ruby

require 'bio'

# make a 'codon'
codon = Bio::Sequence::NA.new("uug")

# you can translate the codon as described in the previous section.
puts codon.translate  # => "L"

#!/usr/bin/env ruby

require 'bio'

# make a 'codon'
codon = Bio::Sequence::NA.new("uug")

# select the standard codon table
codon_table = Bio::CodonTable[1]

# You need to convert RNA codon to DNA alphabets because the
# CodonTable in BioRuby is implemented as a static Hash with keys
# expressed in DNA alphabets (not RNA alphabets).
codon2 = codon.dna

# get the representation of that codon and translate to amino acid.
amino_acid = codon_table[codon2]
puts amino_acid        # => "L"

#!/usr/bin/env ruby

require 'bio'

# Generates a sample 100bp sequence.
seq1 = Bio::Sequence::NA.new("aatgacccgt" * 10)

# Naming this sequence as "testseq" and print in FASTA format
# (folded by 60 chars per line).
puts seq1.to_fasta("testseq", 60)

#!/usr/bin/env ruby

require 'bio'

file = Bio::FastaFormat.open(ARGV.shift)
file.each do |entry|
# do something on each fasta sequence entry
end

#!/usr/bin/env ruby

require 'bio'

Bio::FlatFile.auto(ARGF) do |ff|
ff.each do |entry|
  # do something on each fasta sequence entry
end
end

#!/usr/bin/env ruby

require 'bio'

Bio::FlatFile.open(Bio::FastaFormat, ARGV[0]) do |ff|
  ff.each do |entry|
    # do something on each fasta sequence entry
  end
end


A BioRuby shell on Rails

Stable release	1.5.0 / 1 July 2015 (2015-07-01)

Repository	https://github.com/bioruby/bioruby
Written in	Ruby
Type	Bioinformatics
License	GPL
Website	bioruby.open-bio.org

Class names	Description
Bio::Sequence::NA, Bio::Sequence::AA	Nucleic and amino acid sequences
Bio::Locations, Bio::Features	Locations / Annotations
Bio::Reference, Bio::PubMed	Literatures
Bio::Pathway, Bio::Relation	Graphs
Bio::Alignment	Alignments

Class names	Description
Bio::GenBank, Bio::EMBL	GenBank / EMBL
Bio::SPTR, Bio::NBRF, Bio::PDB	SwissProt and TrEMBL / PIR / PDB
Bio::FANTOM	FANTOM DB (Functional annotation of mouse)
Bio::KEGG	KEGG database parsers
Bio::GO, Bio::GFF	Bio::PROSITE FASTA format / PROSITE motifs
Bio::FastaFormat, Bio::PROSITE	FASTA format / PROSITE motifs

Class names	Description
Bio::Blast, Bio::Fasta, Bio::HMMER	Sequence similarity (BLAST / FASTA / HMMER)
Bio::ClustalW, Bio::MAFFT	Multiple sequence alignment (ClustalW / MAFFT)
Bio::PSORT, Bio::TargetP	Protein subcellular localization (PSORT / TargetP)
Bio::SOSUI, Bio::TMHMM	Transmembrane helix prediction (SOSUI / TMHMM)
Bio::GenScan	Gene finding (GenScan)

Class names	Description
Bio::Registry	OBDA Registry service
Bio::SQL	OBDA BioSQL RDB schema
Bio::Fetch	OBDA BioFetch via HTTP
Bio::FlatFileIndex	OBDA flat file indexing system
OBDA flat file indexing system	Flat file reader with data format autodetection
Bio::DAS	Distributed Annotation System (DAS)
Bio::KEGG::API	SOAP/WSDL intarface for KEGG

BioRuby

History

BioRuby

Version history^[8]

Installation

Installation of BioRuby

Mac OS X/Unix/Linux

Windows

Usage

Basic Syntax^[10]

Basic Sequence Manipulation

String to Bio::Sequence object

Bio::Sequence object to String

Translation

Translating a DNA or RNA Sequence or SymbolList to Protein

Translating a single codon to a single amino acid

Sequence I/O

Writing Sequences in Fasta format

Reading in a Fasta file

Classes and Modules

Major Classes

Basic Data Structure

Databases and sequence file formats

Wrapper and parsers for bioinformatics tool

File, network and database I/O

Biogem

Popular Biogems

Plugins

See also^[14]

BioRuby

Ruby/bioinformatics links

Sister projects

Blogs

References

External links

#	Biogem	Description	Version
1	bio	Bioinformatics Library	1.4.3.0001
2	biodiversity	Parser of scientific names	3.1.5
3	Simple Spreadsheet extractor	Basic spreadsheet content extraction using Apache poi	0.13.3
4	Bio gem	Software generator for Ruby	1.36
5	Bio samtools	Binder of samtools for Ruby	2.1.0
6	t2 server	Support for interacting with the taverna 2 server	1.1.0
7	bio ucsc api	The Ruby ucsc api	0.6.2
8	entrez	http request to entrez e-utilities	0.5.8.1
9	bio gadget	Gadget for bioinformatics	0.4.8
10	sequenceserver	Blast search made easy!	0.8.7

BioRuby

History

BioRuby

Version history[8]

Installation

Installation of BioRuby

Mac OS X/Unix/Linux

Windows

Usage

Basic Syntax[10]

Basic Sequence Manipulation

String to Bio::Sequence object

Bio::Sequence object to String

Translation

Translating a DNA or RNA Sequence or SymbolList to Protein

Translating a single codon to a single amino acid

Sequence I/O

Writing Sequences in Fasta format

Reading in a Fasta file

Classes and Modules

Major Classes

Basic Data Structure

Databases and sequence file formats

Wrapper and parsers for bioinformatics tool

File, network and database I/O

Biogem

Popular Biogems

Plugins

See also[14]

BioRuby

Ruby/bioinformatics links

Sister projects

Blogs

References

External links

Version history^[8]

Basic Syntax^[10]

See also^[14]