100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.2 TrustPilot
logo-home
Summary

Summary Example of all possible calculations in the exams

Rating
4,0
(1)
Sold
3
Pages
2
Uploaded on
26-12-2020
Written in
2020/2021

Example of all possible calculations in the exams









Whoops! We can’t load your doc right now. Try again or contact support.

Document information

Summarized whole book?
Yes
Uploaded on
December 26, 2020
Number of pages
2
Written in
2020/2021
Type
Summary

Content preview

Assume that eukaryotes have approximately 24,000 protein-coding
genes and that an
average eukaryotic protein is 375 amino acids long. What would be the
total length (in
Mbp) occupied by protein-coding genes in an average eukaryotic
genome?
24,000 genes x 375 amino acids x 3 bases/amino acid = 27,000,000 bp = 27
Mbp

Consider the formulae below and answer the questions that follow.
Coverage, c = [(number of reads, N) x (length of a read, L)]/(genome length, G)
Coverage, c = NL/G
Probability that a base is not sequenced, P = e -c, where e is the base of natural
logarithms with a
constant value of 2.718
Total expected gap length = G x e-c
Total number of gaps = Ne-c
A genome has the size of 4,459 Mbps. The genome was sequenced
through random 300 bp
fragments to yield 92.49 million reads. 1 Mbp = 106 bp.

What coverage does the sequences generated above represent? (4) (2
decimals)
Coverage, c = NL/G
= (92.49 x 106)(300)/(4,459 x 106)
= 6.22
3. What is the probability that a specific base was not sequenced? (2) (3
decimals)
Probability that a base is not sequenced, P = e -c
= 2.718-6.22
= 0.002
4. How many gaps would you expect in the assembled sequences? (2)
What total gap length
would you expect in the assembled sequences? (2) (2 decimals and
Mbp)
Total number of gaps = Ne-c
= (92.49 x 106)(2.718-6.22)
= 184 104
Total expected gap length = G x e-c
= (4,459 x 106)(2.718-6.22)
= 8 875 766.70 bp
= 8.88 Mbp
5. If the gap length is to be limited to 2 Mbp following sequence
assembly, what coverage
should you aim for during sequencing? (4) (2 decimals)
Total expected gap length = G x e-c
2 x 106 = (4,459 x 106) x e-c
e-c = (2 x 106)/(4,459 x 106)
ln e-c = ln (2 x 106)/(4,459 x 106)
c = 7.71
How many reads will you need to produce if you intend to have no more
than 1000 gaps at twelve-fold coverage? (3)

Reviews from verified buyers

Showing all reviews
4 year ago

4,0

1 reviews

5
0
4
1
3
0
2
0
1
0
Trustworthy reviews on Stuvia

All reviews are made by real Stuvia users after verified purchases.

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
charneb1 University of the Freestate
View profile
Follow You need to be logged in order to follow users or courses
Sold
25
Member since
4 year
Number of followers
9
Documents
17
Last sold
2 year ago

4,8

5 reviews

5
4
4
1
3
0
2
0
1
0

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their exams and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can immediately select a different document that better matches what you need.

Pay how you prefer, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card or EFT and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions