CSC_MidtermPrep_02

  1. What does the problem set P consist of?
    The set P is the set of all decision problems that can be solved by polynomial-time algorithms

    all problems with poly-time solutions (sorting, etc)
  2. What does the problem set NP consist of?
    Problems that can be checked in polynomial time
  3. Illustrate how we think the relationship between P and NP works:
    Image Upload 2


    We know that P is a subset of NP

    We don’t know if it is a proper subset
  4. How could we prove P ≠ NP?
    we would have to find a single problem in NP that was not in P
  5. What does the set NP-complete help us prove?
    This set will help us in attempting to prove or disprove that P = NP
  6. Examples of the 21 problems Karp came up with as part of the NP-complete set?
    • Directed/undirected Hamiltonian circuit
    • Knapsack problem
    • vertex cover
    • steiner cover
    • Hitting tree
    • Max cut
  7. Describe a reduction algorithm for the following scenario:


    Suppose we want to solve decision problem A, and we have an algorithm that solves decision problem B
    Suppose we can write an algorithm that creates an instance y of problem B for every instance x of problem A s.t.

    The algorithm for B answers yes for y iff the answer to problem A is yes for x

    Such an algorithm is called a reduction algorithm
  8. When can we say that A is poly-time reducible to B (or A is reducible to B)?
    Image Upload 4
    (A is proportional to B)
    If there exists a poly-time algorithm from decision problem A to decision problem B, then we say A is poly-time reducible to B (or A reduces to B)

    Further, if B is in P and A is proportional to B, then A is also in P
  9. What sort of ends do the following restriction enzymes produce?
    Image Upload 6
    Image Upload 8
  10. In what organism were restriction enzymes first discovered?
    E.coli
  11. Gel electrophoresis of DNA sample following digestion with EcoRI
    Image Upload 10
    Image Upload 12
  12. What is the use of restriction mapping?
    Restriction maps are a cheap alternative to sequencing for unknown sequences
  13. What is an application of restriction mapping?
    • Overlap detection
    • Image Upload 14
  14. What is an NP complete problem
    NP-complete problem, any of a class of computational problems for which no efficient solution algorithm has been found.

    There exists no poly-time algorithm to solve it
  15. What is the classification of the Double Digest problem?
    DDP is NP-complete

    • Image Upload 16
    • If and only if there is a solution for this DDP, then there is a solution for the original SPP (set partitioning problem). This means SPP can be reduced in poly-time to DDP and DDP is NP-complete
  16. Describe what happens during a partial digest:
    • DNA is exposed to a restriction enzyme for only a limited amount of time to prevent it from being cut at all restriction sites
    • All possible restriction fragments between every two (not necessarily consecutive) cuts are generated
    • The resulting set of fragment sizes is used to determine the positions of the restriction sites in the DNA sequence
  17. Partial Digest fundamentals:
    X=
    n=
    DX=
    Image Upload 18
  18. Formulation of the Partial Digest Problem
    Image Upload 20
  19. Define: Brute force Algorithm
    Also known as exhaustive search algorithms; examine every possible variant to find a solution

    Efficient in rare cases; usually impractical
  20. Partial Digest Brute Force algorithm
    Image Upload 22
  21. What is the efficiency of BruteForcePDP? How might we improve it?
    BruteForcePDP takes Image Upload 23 time since it must examine all possible sets of positions.

    One way to improve the algorithm is to limit the values of xi to only those values which occur in L.
  22. How did Skinea's Algorithm try to solve the PDP?
    Given PDP data, focus on the largest fragment
  23. How does Skinea's algorithm perform?
    Image Upload 25
  24. What is a megavirus?
    • Special case of Mimi virus
    • It is a very big virus with a small genome
  25. When to use recursion vs iteration?
    If time complexity is the point of focus, and the number of recursive calls would be large, it is better to use iteration. However, if time complexity is not an issue and shortness of code is, recursion would be the way to go.
  26. How do you test runtime of your function?
    • the benchmark function!
    • @benchmark test_function()
    • Image Upload 27
  27. Turnip vs Cabbage gene order comparison
    Image Upload 29
  28. Harmonic series=
    Image Upload 30
  29. geometric series:
    Image Upload 32
  30. Name and define four different genome rearrangement operations

    example sequence: 12345
    Reversal: 12345 -> 12435

    Translocation: 123 & 456 -> 126 & 453

    Fusion: 12 & 345 -> 12345

    Fission: 12345 -> 12 & 345
  31. genome rearrangement:
    Image Upload 34
    Image Upload 36
  32. genome rearrangement:
    Image Upload 38
    Image Upload 40
  33. genome rearrangement:
    Image Upload 42
    Image Upload 44
  34. genome rearrangement:
    Image Upload 46
    Image Upload 48
  35. What is a synteny block?
    block of color describing gene similarity on a genome map
  36. Diseases associated with genome rearrangement (3):
    • Robertsonian Translocation (translocation between 13 and 14, often pregnancy loss or fetal abnormality)
    • Philadelphia Chromosome (translocation between chromosome 9 and 22, often myelogenous leukemia)
    • Colon cancer
  37. Reversal Distance Problem:
    Goal=?
    Input=?
    output=?
    Image Upload 50
  38. How do we measure performance of an algorithm?
    Time, space
  39. Greedy reversal sort algorithm:
    Image Upload 52
  40. What is the purpose of an approximation algorithm
    These algorithms find approximate solutions rather than optimal solutions
  41. Approximation ratio of algorithm A on the input Image Upload 53 is:
    Image Upload 55
  42. For the approximation ratio, if you were trying to minimize the problem, which part of the ratio would be larger?
    Image Upload 57
    • A(Image Upload 58 )
    • Minimization is larger than the optimum
  43. What is APX?
    • APX is the set of NP problems that have a P-time approximation algorithm with constant approximation ratio
    • "My solution is not optimal, but it is not worse than 1.1 (10%) off of optimal"
  44. For algorithm A that minimizes objective function (minimization algorithm):
    Image Upload 60
  45. BreakPointReversal search attempts to (eliminate _#_ of breakpoints every _#_ steps) and will not use more than _#_ the minimum number of reversals)
    At least one breakpoints every two steps, and no more than four times the minimum number of reversals
  46. Evaluate performance guarantee of the BreakpointReversalSort algorithm

    hint: Image Upload 61
    Performance guarantee:

    Image Upload 62
  47. In the DCJ model, how is the genome grouped?
    into chromosomes, linear/circular
  48. Image Upload 64
    (2,3,-4)
    Image Upload 66
  49. Image Upload 68
    Image Upload 70
  50. Image Upload 72
    Image Upload 74
Author
saucyocelot
ID
362929
Card Set
CSC_MidtermPrep_02
Description
More midterm prep!
Updated