BioDBtBioDBtBioDBt
  • Molecular Biology
  • NGS technologies
  • Advanced rDNA
  • Bioinformatics
  • Tools
Notification Show More
BioDBtBioDBt
  • Molecular Biology
  • NGS technologies
  • Advanced rDNA
  • Bioinformatics
  • Tools
Have an existing account? Sign In
Follow US
© 2024 BioDBt
Home » Bioinformatics » Types of BLAST, applications, Algorithm & interpretation of results » Page 2

Types of BLAST, applications, Algorithm & interpretation of results

Beaven
Last updated: 23/10/24
By Beaven - Senior Editor Bioinformatics
Share
15 Min Read
This post may be undergoing an editorial review to improve its content. Updates or revisions may occur to enhance accuracy, clarity, and completeness.
SHARE
Section
2 - Introduction to BLAST
1 - Types of BLAST
2 - Introduction to BLAST
3 - Installation of Standalone BLAST on Windows PC

Types of BLAST

  1. Nucleotide BLAST (BLASTn): compares a nucleotide query (DNA/RNA) against a nucleotide sequence database. Used for searching gene transcripts, primers, SNPs, and other short DNA sequences.
  2. protein BLAST (BLASTp): compares a protein query against a protein sequence database. Used for finding similar proteins, examining functional domains, and exploring protein evolution.
  3. BLASTx: compares a nucleotide query translated in all reading frames against a protein sequence database. Used when a nucleotide sequence is suspected to code for a protein.
  4. tBLASTn: compares a protein query against a translated nucleotide database in all reading frames. Used for finding genes in unfinished genomes based on protein homologies.
  5. tBLASTx: translates the query nucleotide sequence in all reading frames and compares against the translated nucleotide database in all reading frames. Used for finding similarities among genes that may have frameshift sequencing errors.
  6. Protein-protein BLAST (blastp): This program, given a protein query, returns the most similar protein sequences from the protein database that the user specifies.
  7. Position-Specific Iterative BLAST (PSI- BLAST) (blastpgp): This program is used to find distant relatives of a protein.
  8. Large numbers of query sequences (megablast): When comparing large numbers of input sequences via the command-line BLAST, “megablast” is much faster than running BLAST multiple times.

Applications of the different BLAST types

  1. BLASTn – Used for finding homologous gene transcripts, detecting gene families, identifying promoters, confirming cDNA clones, and designing probes/primers.
  2. BLASTp – Widely used for protein function prediction, domain identification, inferring orthologs/paralogs, and constructing phylogenetic trees. Also used to identify protein toxins, allergens, or antimicrobial peptides.
  3. BLASTx – Useful when only a DNA sequence is available and protein homology needs to be checked. Allows putative protein translation products to be compared.
  4. tBLASTn – Used to predict genes in unfinished genomes and for chromosome walking. Allows translated nucleotide databases to be searched using a protein query.
  5. tBLASTx – Helpful for analyzing EST libraries, viral genomes, metagenomes, and comparisons between closely related species. Translation in all frames accommodates sequencing errors.
  6. PSI-BLAST – Specialized version of BLASTp for iteratively searching a database using a profile of previously identified sequences. Useful for discovering distant evolutionary relationships.
  7. PHI-BLAST – Searches a database with a pattern specified as a motif or regular expression. Used for identifying new members of a protein family.
  8. RPS-BLAST – Searches protein databases with a profile defined in a CDD (Conserved Domain Database). Used to identify conserved domains in protein sequences.

Algorithm of BLAST

The blast algorithm is fast, accurate, and web-accessible. It is relatively faster than other sequence similarity search tools. Complex BLAST algorithm requires multiple steps and many parameters

The BLAST algorithm works in three main stages:

  1. Seed generation – From the query sequence, BLAST selects small subsequences of fixed length called “words” or “seeds”. These act as starting points for finding sequence matches.
  2. Extension – BLAST extends each seed word match in both directions along the query sequence as far as possible using a defined substitution matrix to score each extension. This generates High-scoring Segment Pairs (HSPs) representing a local alignment between the query and a database sequence.
  3. Evaluation – The HSPs are evaluated based on statistical significance. BLAST uses statistics to calculate the expectation value (E-value) of each alignment score occurring by chance in a database search. Alignments with significant E-values are reported as hits.

Key parameters in BLAST algorithm:

  • Word size – Length of initial exact match seeds
  • Scoring matrix – Rewards/penalties for matches/mismatches
  • E-value threshold – Cut-off for reporting statistically significant hits

BLAST Steps overview

  1. Specifying A Sequence Of Interest
  2. Selecting BLAST Program
  3. Selecting Database
  4. Selecting Optional Parameters Selecting
  5. Formatting Parameter

Interpretation of BLAST Results

Key aspects of analyzing BLAST results:

  • E-value – Lower E-value means more significant alignment. Generally, E-value < 0.001 is a good hit.
  • Bit score – Higher bit score indicates better alignment.
  • Query coverage – What percentage of the query sequence is covered by the alignment. Higher coverage is better.
  • Max identity – The highest identity between the query and the hit, given in percentage. Higher identity indicates closer match.
  • Alignments – Review the aligned sequences between the query and subject sequences for mismatches, gaps, and matched regions.
  • Subject description – The annotation of the database hit provides information on the putative identification and known function.
distance tree of results
Previous Page123Next Page
TAGGED:BLASTSequence Alignment

Sign Up For Daily Newsletter

Our resources that will help you excel in your academics and research.
By Beaven
Senior Editor
Manjengwa, B. is currently pursuing an M.Sc. (Hons) in Biotechnology at Panjab University, Chandigarh, having completed his B.Sc. (Hons) in Biotechnology. His specialized training includes Next Generation Sequencing Technologies: Data Analysis and Applications, Academic Paper Writing and Intellectual Property Rights (IPR), and Digital Marketing and Management Studies.
Leave a Comment Leave a Comment

Leave a Reply Cancel reply

You must be logged in to post a comment.

Check out these ...

testing

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Ut elit tellus, luctus…

Beaven

Yeast Two-Hybdrid (Y2H) system explained

The Yeast Two-Hybrid (Y2H) system or Yeast Two-Hybrid Assay represents a powerful…

TanviBeaven

Ligase Chain Reaction (LCR) Explained

Ligase chain reaction (LCR) is a thermostable DNA ligase-dependent DNA amplification which…

Beaven Tags: Ligase Chain Reaction (LCR)
BioDBtBioDBt
Follow US
© 2024 BioDBt (Bioinformatics-Driven Biotechnology)
  • Privacy Policy
  • Cookie Policy
  • About us
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?

Not a member? Sign Up