Bienvenue sur Scribd !

Ignorer le carrousel

4.1. Pairwise Alignment - 2

Transféré par

Katona imre

0% ont trouvé ce document utile (0 vote)

18 vues4 pages

ede

Titre original

4.1. Pairwise Alignment_2

Copyright

Formats disponibles

PDF, TXT ou lisez en ligne sur Scribd

Partager ce document

Partager ou intégrer le document

Options de partage

Avez-vous trouvé ce document utile ?

Ce contenu est-il inapproprié ?

Signaler ce document

ede

Droits d'auteur :

Formats disponibles

Téléchargez comme PDF, TXT ou lisez en ligne sur Scribd

Signaler comme contenu inapproprié

0% ont trouvé ce document utile (0 vote)

18 vues4 pages

4.1. Pairwise Alignment - 2

Transféré par

Katona imre

ede

Droits d'auteur :

Formats disponibles

Téléchargez comme PDF, TXT ou lisez en ligne sur Scribd

Signaler comme contenu inapproprié

Passer à la page

Vous êtes sur la page 1sur 4

Rechercher à l'intérieur du document

Introduction to Pairwise

Alignment
Sequence alignment is a way of comparing two primary sequences of DNA,
RNA, or protein

In principle:
two sequences are written out, one on top of the other

gacctaatcgtgaccatttgcgcgcttaaaatccgtta
attgacctaaatcgtgaccatgcgcgcttaaaatccgttaaaaa

then one sequence is moved with respect to the other, and gaps are inserted in each
sequence to maximize identical pairs of bases (or similar pairs of amino acids) lining up
on the top of each other.

gaccta-atcgtgaccatttgcgcgcttaaaatccgtta
attgacctaaatcgtgaccat--gcgcgcttaaaatccgttaaaaa

gap

Alignment can be done by hand using any word processor or text editor to
move lines of text and insert spaces.
It is a relatively straightforward computational problem.

Examples:
identical sequences ACACACTA
ACACACTA

A C A C A C T A
A
C
A
C
A
C
T
A

different sequences ACACACTA

AGCACACA A gap has to be inserted

A C A C A C T A
A
G
C A gap has to be inserted
A
C
A
C
A

alignments: A-CACACTA ACACACT-A

AGCACAC-A AG-CACACA

7 identities, 5 identities,
2 gaps 2 gaps,
2 mismatches

more different sequences ATGCGTCGTT (longer)

ATCCGCGAT (shorter)

alignments: X Y
ATGC-GTCGTT ATGCGTCGTT
AT-CCG-CGAT ATCCG-CGAT

7 identities, 7 identities,
3 gaps 1 gap
1 mismatch 2 mismatches
„Evaluation” of alignments
The overall quality of the alignment is evaluated based on formulas that count the number
of identical (or similar) pairs, mismatches and gaps.

In the formulas the mismatches and gaps are penalized.

- gaps are penalized significantly more than mismatches

since an indiscriminate use of gaps can force an alignment between virtually any pair of
sequences – leading to a meaningless result.

- the penalty assigned to a gap is (usually) not proportional to the size of the gap
because, from biological perspective, larger insertions/deletions (indels) are almost as
common as single-base indels.

To „evaluate” the alignments, two approaches can be used:

- distance is measured → distance index
- similarity is measured → similarity index

The two methods give the same results in most cases

Let us consider the distance method in more detail:

D: the distance between the two sequences

D = Min(w1β + w2γ)

Min: minimum value of (w1β + w2γ) among all possible alignments

(α: the number of pairs of matched elements)
β: the number of pairs of mismatched elements
γ: the number of gaps irrespective of gap length
w1: arbitrarily penalty for a mismatch (e.g. 1)
w2: arbitrarily penalty for a gap (e.g. 4)
(w1 is usually smaller than w2 because deletion and insertion occur less
frequently than substitution)

(let’s apply this equation to the above example:)

For X: D = 1x1 + 3x4 = 13
For Y: D = 2x1 + 1x4 = 6

6 is smaller than 13, and thus alignment Y is considered better than X

For many possible alignments we can use the following formula:

Kmax
D = Min(nd + Σw n ) k k
i=1
nd: the number of different (mismatched) elements
wk: the penalty for a gap of k nucleotides
nk: the number of gaps with length k
kmax: the maximum gap length allowed
The distance method is a protocol for finding the alignment with the smallest
D

For the similarity method a similar formula can be used:

Kmax
S = Max(nm - Σw n ) k k
i=1
nm: the number of identical(matched) elements
The similarity method is a protocol of finding an alignment with the
maximum S

Introduction to Bioinformatics
Matthias Sipiczki, Department of Genetics, University of Debrecen
Comments to: lipovy@tigris.unideb.hu

Vous aimerez peut-être aussi

Encoding Information For DNA Computing: Shinnosuke Seki
Document45 pages
Encoding Information For DNA Computing: Shinnosuke Seki
NNMSA
Pas encore d'évaluation
Approximation Algorithms 1
Document56 pages
Approximation Algorithms 1
sandhiyasubramaniyan2001
Pas encore d'évaluation
Section-A: Course Code: 21ODMCH611 UID:D21MCA11642
Document13 pages
Section-A: Course Code: 21ODMCH611 UID:D21MCA11642
Rishi
Pas encore d'évaluation
Design of DNA Origami: Paul W.K. Rothemund
Document8 pages
Design of DNA Origami: Paul W.K. Rothemund
mehdi1902
Pas encore d'évaluation
Design of Non-Binary Quasi-Cyclic LDPC Codes by ACE Optimization
Document6 pages
Design of Non-Binary Quasi-Cyclic LDPC Codes by ACE Optimization
Sinshaw Bekele
Pas encore d'évaluation
Applications of Random Sampling in Computational Geometry, II
Document11 pages
Applications of Random Sampling in Computational Geometry, II
Miguel Alfonso Olfato
Pas encore d'évaluation
PCB Lect02 Pairwise Allign
Document51 pages
PCB Lect02 Pairwise Allign
Livs
Pas encore d'évaluation
Assignment 1
Document5 pages
Assignment 1
mk5514075
Pas encore d'évaluation
DNA and Protein Sequence Alignment Guide
Document96 pages
DNA and Protein Sequence Alignment Guide
黃柏翰
Pas encore d'évaluation
K-Test - 31
Document38 pages
K-Test - 31
Prashin Varshney
Pas encore d'évaluation
(Intuition: Conservation Law
Document17 pages
(Intuition: Conservation Law
Elayavel Raja
Pas encore d'évaluation
8.2 Graphsandconvergence: Discrete Mathematics: Functions On The Set of Natural Numbers
Document9 pages
8.2 Graphsandconvergence: Discrete Mathematics: Functions On The Set of Natural Numbers
Varun Singh
Pas encore d'évaluation
ITW 1998 Low Density Parity Check Codes over Finite Fields
Document2 pages
ITW 1998 Low Density Parity Check Codes over Finite Fields
Luis Camacho Flores
Pas encore d'évaluation
Approx
Document28 pages
Approx
Muthuvarshini S
Pas encore d'évaluation
Pairwise Sequence Alignment: CS 838 WWW - Cs.wisc - Edu/ Craven/cs838.html Mark Craven Craven@biostat - Wisc.edu January 2001
Document18 pages
Pairwise Sequence Alignment: CS 838 WWW - Cs.wisc - Edu/ Craven/cs838.html Mark Craven Craven@biostat - Wisc.edu January 2001
Fadhili Dunga
Pas encore d'évaluation
Pairwise Alignment 2017
Document49 pages
Pairwise Alignment 2017
mn
Pas encore d'évaluation
Quasi-Orthogonal STBC With Minimum Decoding Complexity
Document15 pages
Quasi-Orthogonal STBC With Minimum Decoding Complexity
chessgenius
Pas encore d'évaluation
2019 Evomics Reference Free
Document118 pages
2019 Evomics Reference Free
drcyber
Pas encore d'évaluation
Multivariate Volatility Forecast and Multi Asset Perp Pricing
Document8 pages
Multivariate Volatility Forecast and Multi Asset Perp Pricing
Eric John Juta
Pas encore d'évaluation
Strakos: On The Real Convergence Rate of The Conjugate Gradient Method
Document15 pages
Strakos: On The Real Convergence Rate of The Conjugate Gradient Method
Marco Antonio Zuñiga Perez
Pas encore d'évaluation
Sequence Alignment Methods Final
Document69 pages
Sequence Alignment Methods Final
Dr. Kaushal Kishor Sharma
Pas encore d'évaluation
Proving Lower Bounds Via Pseudo-Random Generators
Document14 pages
Proving Lower Bounds Via Pseudo-Random Generators
Sangat Baik
Pas encore d'évaluation
Dot Plot
Document18 pages
Dot Plot
Usama Ay
Pas encore d'évaluation
Co Kriging PDF
Document4 pages
Co Kriging PDF
Miguel Angel Cañon Ramos
Pas encore d'évaluation
12 Filter algorithms
Document7 pages
12 Filter algorithms
dethleff901
Pas encore d'évaluation
HW 08
Document2 pages
HW 08
misiyak112
Pas encore d'évaluation
CSC 2426 Report
Document3 pages
CSC 2426 Report
nekriachvv
Pas encore d'évaluation
Sequence To Graph Alignment Using Gap-Sensitive Co-Linear Chaining
Document16 pages
Sequence To Graph Alignment Using Gap-Sensitive Co-Linear Chaining
lbqurtfts
Pas encore d'évaluation
Entanglement-Assisted and Subsystem Quantum Codes - New Propagation Rules and Constructions
Document14 pages
Entanglement-Assisted and Subsystem Quantum Codes - New Propagation Rules and Constructions
Marvin Olavides
Pas encore d'évaluation
Block Cipher Modes and Padding Techniques
Document8 pages
Block Cipher Modes and Padding Techniques
Conan
Pas encore d'évaluation
Problem - E - Codeforces
Document2 pages
Problem - E - Codeforces
Hữu Đạt Nguyễn
Pas encore d'évaluation
Safe and Effective Determinant Evaluation: February 25, 1994
Document16 pages
Safe and Effective Determinant Evaluation: February 25, 1994
David Immanuel
Pas encore d'évaluation
Thuat-Toan-Va-Ung-Dung - Pham-Quang-Dung,-Do-Phan-Thuan - Finaltest - (Cuuduongthancong - Com)
Document5 pages
Thuat-Toan-Va-Ung-Dung - Pham-Quang-Dung,-Do-Phan-Thuan - Finaltest - (Cuuduongthancong - Com)
Bao Trung Thai
Pas encore d'évaluation
Li 2020 Dirichlet Graph Variational Autoencoder
Document10 pages
Li 2020 Dirichlet Graph Variational Autoencoder
Andre Lamurias
Pas encore d'évaluation
solved ATP paper workbook
Document82 pages
solved ATP paper workbook
raidah iftikhar
Pas encore d'évaluation
Chemical Reaction Engineering Lecture on Rate Laws and Methods
Document40 pages
Chemical Reaction Engineering Lecture on Rate Laws and Methods
Rohan Pawar
Pas encore d'évaluation
Efficient Maximum Likelihood Decoding of Linear Block Codes Using A Trellis
Document5 pages
Efficient Maximum Likelihood Decoding of Linear Block Codes Using A Trellis
vidisha nitin
Pas encore d'évaluation
On Ternary LCD Codes
Document6 pages
On Ternary LCD Codes
huevonomar05
Pas encore d'évaluation
Is Spacetime a Quantum Error-Correcting Code? Evidence from Holographic Duality and Tensor Networks
Document25 pages
Is Spacetime a Quantum Error-Correcting Code? Evidence from Holographic Duality and Tensor Networks
Nannai02
Pas encore d'évaluation
Networks and Graphs: Understanding Structure and Dynamics
Document77 pages
Networks and Graphs: Understanding Structure and Dynamics
Jessie
Pas encore d'évaluation
Ps 10
Document2 pages
Ps 10
Mano Ranjith Kumar M 007684
Pas encore d'évaluation
Goodcode
Document21 pages
Goodcode
idan kahan
Pas encore d'évaluation
Gr12 Advance Function Ch4
Document50 pages
Gr12 Advance Function Ch4
layandy
Pas encore d'évaluation
Gene Finding and HMMS: 6.096 - Algorithms For Computational Biology - Lecture 7
Document69 pages
Gene Finding and HMMS: 6.096 - Algorithms For Computational Biology - Lecture 7
fvenky
Pas encore d'évaluation
Fulltext PDF
Document10 pages
Fulltext PDF
Lovinf Florin
Pas encore d'évaluation
Assign Pro
Document3 pages
Assign Pro
Rajesh Shukla
Pas encore d'évaluation
Zkproofs
Document4 pages
Zkproofs
Chí Hưng
Pas encore d'évaluation
Chemical Reaction Engineering Rate Laws and Methods
Document40 pages
Chemical Reaction Engineering Rate Laws and Methods
Rohan Pawar
Pas encore d'évaluation
Fast Parallel Algorithms For Testing K-Connectivity of Directed and Undirected Graphs
Document5 pages
Fast Parallel Algorithms For Testing K-Connectivity of Directed and Undirected Graphs
Yakov Abrams
Pas encore d'évaluation
Linear Codes: 3.1 Basics
Document17 pages
Linear Codes: 3.1 Basics
BudianTo Yang
Pas encore d'évaluation
A New Upper Bound and Optimal Constructions of Equi-Difference Conflict-Avoiding Codes On Constant Weight
Document9 pages
A New Upper Bound and Optimal Constructions of Equi-Difference Conflict-Avoiding Codes On Constant Weight
Tudor Micu
Pas encore d'évaluation
03 Questions
Document2 pages
03 Questions
Abdul-Rhman Mohamed
Pas encore d'évaluation
Coulomb's Law and Applications - QP - AJ
Document6 pages
Coulomb's Law and Applications - QP - AJ
Cadet
Pas encore d'évaluation
Lecture 13: Expander Codes
Document4 pages
Lecture 13: Expander Codes
shtompel07
Pas encore d'évaluation
Ps 2
Document2 pages
Ps 2
VIJAYPUTRA
Pas encore d'évaluation
PSAT-ALGEBRA QUESTIONS2
Document50 pages
PSAT-ALGEBRA QUESTIONS2
favourbernard065
Pas encore d'évaluation
hw3 P
Document15 pages
hw3 P
ballechase
50% (2)
Analytical Design of Adc
Document6 pages
Analytical Design of Adc
bganeshsai
Pas encore d'évaluation
A Closed-Form Charge-Based Expression For Drain Current in Symmetric and Asymmetric Double Gate MOSFET
Document4 pages
A Closed-Form Charge-Based Expression For Drain Current in Symmetric and Asymmetric Double Gate MOSFET
Zuona Chen
Pas encore d'évaluation
Discrete Mathematics with Applications
D'Everand
Discrete Mathematics with Applications
Thomas Koshy
Évaluation : 3 sur 5 étoiles
3/5 (9)
Toti
Document17 pages
Toti
Katona imre
Pas encore d'évaluation
Martin Carson J 200505 Ma
Document131 pages
Martin Carson J 200505 Ma
Katona imre
Pas encore d'évaluation
Conventional Blood Banking and Blood Component Storage Regulation: Opportunities For Improvement
Document7 pages
Conventional Blood Banking and Blood Component Storage Regulation: Opportunities For Improvement
Katona imre
Pas encore d'évaluation
PDF Compression, Ocr, Web Optimization Using A Watermarked Evaluation Copy of Cvision Pdfcompressor
Document12 pages
PDF Compression, Ocr, Web Optimization Using A Watermarked Evaluation Copy of Cvision Pdfcompressor
Katona imre
Pas encore d'évaluation
Jclinpath00271 0080
Document1 page
Jclinpath00271 0080
Katona imre
Pas encore d'évaluation
Swift Playgrounds Is A Revolutionary New Ipad App That Helps You Learn and Explore Coding Swift.
Document1 page
Swift Playgrounds Is A Revolutionary New Ipad App That Helps You Learn and Explore Coding Swift.
Katona imre
Pas encore d'évaluation
BLT 06 235
Document3 pages
BLT 06 235
Katona imre
Pas encore d'évaluation
Old Blood, New Blood or Better Stored Blood?: Giancarlo Maria Liumbruno, James P. Aubuchon
Document3 pages
Old Blood, New Blood or Better Stored Blood?: Giancarlo Maria Liumbruno, James P. Aubuchon
Katona imre
Pas encore d'évaluation
Mega Stat
Document6 pages
Mega Stat
Katona imre
Pas encore d'évaluation
BD Multicolor Fluorochrome Specs
Document1 page
BD Multicolor Fluorochrome Specs
Katona imre
Pas encore d'évaluation
Sterilizers For Medical Purposes
Document5 pages
Sterilizers For Medical Purposes
Katona imre
Pas encore d'évaluation
CEN/TC 332 Business Plan Overview
Document5 pages
CEN/TC 332 Business Plan Overview
Katona imre
Pas encore d'évaluation
Alcumus Isoqar Iso 45001 Gap Analysis PDF
Document15 pages
Alcumus Isoqar Iso 45001 Gap Analysis PDF
Bryan
Pas encore d'évaluation
Cheat Sheet
Document2 pages
Cheat Sheet
Cmpt Cmpt
Pas encore d'évaluation
PhD program at TU Wien develops new drug technologies
Document1 page
PhD program at TU Wien develops new drug technologies
Katona imre
Pas encore d'évaluation
Full Circle Magazine - October 2018
Document45 pages
Full Circle Magazine - October 2018
Katona imre
Pas encore d'évaluation
Libre Office
Document8 pages
Libre Office
Katona imre
Pas encore d'évaluation
Pillepalackbol Hőerőmű
Document5 pages
Pillepalackbol Hőerőmű
Katona imre
Pas encore d'évaluation
Red Blood Cell Transfusion Pocket Guide
Document37 pages
Red Blood Cell Transfusion Pocket Guide
Katona imre
Pas encore d'évaluation
Device Model Recall Notification Letter - Slides
Document24 pages
Device Model Recall Notification Letter - Slides
Katona imre
Pas encore d'évaluation
GFI #64 - VICH GL2 Methodology
Document14 pages
GFI #64 - VICH GL2 Methodology
Ruben Santiago-Adame
Pas encore d'évaluation
2012 Red Blood Cell Transfusion Pocket Guide
Document4 pages
2012 Red Blood Cell Transfusion Pocket Guide
Andrey Setiawan
Pas encore d'évaluation
Commodore 64 Reference Guide
Document3 pages
Commodore 64 Reference Guide
scottmac67
Pas encore d'évaluation
GLP Quality Audit Manual: Milton A. Anderson
Document10 pages
GLP Quality Audit Manual: Milton A. Anderson
Katona imre
0% (1)
Bard Q12017 April26
Document10 pages
Bard Q12017 April26
Katona imre
Pas encore d'évaluation
Platelet Aggregation and Quality Control of Platelet Concentrates Produced PDF
Document5 pages
Platelet Aggregation and Quality Control of Platelet Concentrates Produced PDF
Katona imre
Pas encore d'évaluation
For Posting - Final Guidance Injector June 2013
Document29 pages
For Posting - Final Guidance Injector June 2013
Katona imre
Pas encore d'évaluation
UCM471276
Document30 pages
UCM471276
Eckhard
Pas encore d'évaluation
Commodore 64 Reference Guide
Document3 pages
Commodore 64 Reference Guide
scottmac67
Pas encore d'évaluation
Micropitting Can Lead To Macro Problems
Document2 pages
Micropitting Can Lead To Macro Problems
Anonymous alQXB11EgQ
Pas encore d'évaluation
M44 40series
Document9 pages
M44 40series
Kaspar Rapsak
Pas encore d'évaluation
DrillersManual Chapters 1 12
Document192 pages
DrillersManual Chapters 1 12
Hugo Morales
Pas encore d'évaluation
Compendium Templates Nutrition Facts Tables
Document86 pages
Compendium Templates Nutrition Facts Tables
FerociousWolf
Pas encore d'évaluation
Paediatric Doses of Drugs
Document2 pages
Paediatric Doses of Drugs
umapathisivan
Pas encore d'évaluation
9472761
Document23 pages
9472761
Emerson Kohlrausch
Pas encore d'évaluation
In The Name of God
Document34 pages
In The Name of God
Fariha Norman
Pas encore d'évaluation
Equations of State
Document33 pages
Equations of State
Devika Bharathan
Pas encore d'évaluation
Brochure Coating Construction
Document16 pages
Brochure Coating Construction
ALİ ÖRS
Pas encore d'évaluation
It Thesis
Document59 pages
It Thesis
roneldayo62
100% (2)
Construction and Building Materials: Baojian Zhan, Chi Sun Poon, Qiong Liu, Shicong Kou, Caijun Shi
Document5 pages
Construction and Building Materials: Baojian Zhan, Chi Sun Poon, Qiong Liu, Shicong Kou, Caijun Shi
Sara_Parker
Pas encore d'évaluation
METALS Presentation
Document28 pages
METALS Presentation
Theresa Tuliao
Pas encore d'évaluation
008x TF-compressed PDF
Document66 pages
008x TF-compressed PDF
José Francisco Blanco Villalba
Pas encore d'évaluation
Introduction To GFRC
Document3 pages
Introduction To GFRC
Fred Victor
Pas encore d'évaluation
DIPPR Physical Properties Database
Document8 pages
DIPPR Physical Properties Database
Omar Almonte
Pas encore d'évaluation
Drilling Engineering Fluid Properties
Document29 pages
Drilling Engineering Fluid Properties
Deepak Rana
Pas encore d'évaluation
v3.3 - Cut-Off - Activated Carbon Production Granular From Hard Coal - Rer - Activated Carbon Granular 1
Document14 pages
v3.3 - Cut-Off - Activated Carbon Production Granular From Hard Coal - Rer - Activated Carbon Granular 1
jim
Pas encore d'évaluation
USP <1115> Impact on Bioburden Control
Document65 pages
USP <1115> Impact on Bioburden Control
Blank Backtobasic
100% (1)
Astm F139
Document5 pages
Astm F139
diegomez84
Pas encore d'évaluation
Comparing antioxidant assays for estimating activity in guava extracts
Document7 pages
Comparing antioxidant assays for estimating activity in guava extracts
Fira Kuswandari
Pas encore d'évaluation
NEUROPHYSIOLOGY
Document224 pages
NEUROPHYSIOLOGY
Kheliwi
100% (2)
How Do Water Softeners Work
Document3 pages
How Do Water Softeners Work
nermeen ahmed
Pas encore d'évaluation
Phosphate solubilizing bacteria promote tomato growth
Document10 pages
Phosphate solubilizing bacteria promote tomato growth
Vijay Singh Kunwar
Pas encore d'évaluation
10th STD Science Carbon and Its Compounds Lesson Plan Eng Version 2017-18
Document5 pages
10th STD Science Carbon and Its Compounds Lesson Plan Eng Version 2017-18
vijos16655
Pas encore d'évaluation
NAF-Check Tilting Disc Check Valves FK 30.70 (11) GB: Characteristics
Document8 pages
NAF-Check Tilting Disc Check Valves FK 30.70 (11) GB: Characteristics
Roboven
Pas encore d'évaluation
The Concrete Producer Article PDF - Comparing The Options For Cooling Concrete
Document4 pages
The Concrete Producer Article PDF - Comparing The Options For Cooling Concrete
arangar1
100% (1)
Evacuated Tube System
Document2 pages
Evacuated Tube System
Aaron James Ruedas
Pas encore d'évaluation
Confined Space
Document31 pages
Confined Space
gshdavid
Pas encore d'évaluation
Tài liệu ôn tập tiếng anh 4
Document7 pages
Tài liệu ôn tập tiếng anh 4
Ngọc Amii
Pas encore d'évaluation
Cap Trade
Document8 pages
Cap Trade
Ekopribadi
Pas encore d'évaluation