BLAST Analysis
What is it ?
Basic Local Alignment Search Tool .
It finds local similarities between sequences .
By comparing nucleotide or protein sequences against databases and then calculating
statistical values for the probability that they are not related by chance .
Sequencing allows researchers to study any changes in genes associations
,
with diseases
and phenotypes and identify potential drug targets (personalised medicine )
In evolutionary biology ,
DNA sequencing helps studying how different organisms are related
and how they evolved .
DNA sequencing allowed understanding the SARS -
COVID -2 antigenic variation and designing
cutting edge vaccines (
Astrazeneca ,
Pfizer
,
Moderna ) .
As of September 2021 ,
NCBI database contains 288,903,207 records , including 210,703,648
Proteins 40,213,945 RNAs and sequences
from 113,002 organisms .
,
What is FASTA format ?
Text based format
-
for representing nucleotide sequences of peptide sequences using
single -
letter codes .
It begins with a
single line description ,
followed by lines of sequence data .
The description line is distinguished from the sequence data by a greater than
-
G) symbol
in the first column .
Blank lines aren't allowed in the middle of FASTA input .
What is it ?
Basic Local Alignment Search Tool .
It finds local similarities between sequences .
By comparing nucleotide or protein sequences against databases and then calculating
statistical values for the probability that they are not related by chance .
Sequencing allows researchers to study any changes in genes associations
,
with diseases
and phenotypes and identify potential drug targets (personalised medicine )
In evolutionary biology ,
DNA sequencing helps studying how different organisms are related
and how they evolved .
DNA sequencing allowed understanding the SARS -
COVID -2 antigenic variation and designing
cutting edge vaccines (
Astrazeneca ,
Pfizer
,
Moderna ) .
As of September 2021 ,
NCBI database contains 288,903,207 records , including 210,703,648
Proteins 40,213,945 RNAs and sequences
from 113,002 organisms .
,
What is FASTA format ?
Text based format
-
for representing nucleotide sequences of peptide sequences using
single -
letter codes .
It begins with a
single line description ,
followed by lines of sequence data .
The description line is distinguished from the sequence data by a greater than
-
G) symbol
in the first column .
Blank lines aren't allowed in the middle of FASTA input .