Method for detecting copy number variation of gene long fragment
1. A method for detecting copy number variation of a long fragment of a gene, comprising the steps of:
(1) extracting human genome DNA of a sample to be detected by adopting a DNA extraction kit;
(2) adding an artificial internal standard by taking human genome DNA of a sample to be detected as a template, and obtaining an amplification product of the human genome DNA and an amplification product of the artificial internal standard through multiple PCR (polymerase chain reaction) amplification reaction;
(3) making the product of the amplification reaction pass through shrimp alkaline phosphatase reaction to remove redundant deoxyribonucleoside triphosphate;
(4) extending a base at the 3' end of the extension primer by the product of the step (3) through an extension reaction to obtain an extension product of the human genome DNA and an extension product of the artificial internal standard;
(5) purifying the product of the extension reaction by resin, desalting;
(6) spotting the desalted product on a mass spectrum chip substrate for co-crystallization, transferring the crystal into a vacuum tube of a mass spectrometer, and exciting by laser to perform MassARRAY flight time mass spectrum detection;
(7) according to FREQ data obtained by MassARRAY flight time mass spectrum detection, calculating a copy number variation coefficient (TR value), and judging the copy number of the gene segment.
2. The method of claim 1, wherein the copy number variation of the long fragment of the gene is detected by: the artificial internal standard in the step (2) is a synthesized double-stranded DNA fragment or a plasmid containing the synthesized double-stranded DNA fragment; wherein the synthesized double-stranded DNA fragment is only 1 base different from the target amplified fragment of the sample to be detected, and the different bases on the two fragments are adjacent to the 3' end of the extension primer.
3. The method of claim 1, wherein the copy number variation of the long fragment of the gene is detected by: the amplification product of the human genome DNA and the amplification product of the artificial internal standard obtained in the step (2) both comprise a target detection gene fragment and a reference gene fragment; the length of the target detection gene fragment is 70-600 bp; the reference gene is one or more of RNaseP, EIF2C1, ALB and GAPDH.
4. The method of claim 1, wherein the copy number variation of the long fragment of the gene is detected by: in the step (2), the system of the multiplex PCR amplification reaction comprises PCR Buffer and MgCl2dNTPs, dUTP, PCR primers Pool, UNG enzyme and PCR enzyme.
5. The method of claim 4, wherein the copy number variation of the long fragment of the gene is detected by: the PCR primer Pool is a mixture of a target detection gene amplification primer and an internal reference gene amplification primer.
6. The method of claim 1, wherein the copy number variation of the long fragment of the gene is detected by: in the step (3), the shrimp alkaline phosphatase reaction system comprises SAP enzyme and SAP Buffer.
7. The method of claim 1, wherein the copy number variation of the long fragment of the gene is detected by: in the step (4), the extension reaction system comprises Buffer Plus, Termination Mix, extension primer mixture and iPLEXEnzyme; wherein the extension primer mixture is matched with the sequences of the amplification products of the human genome DNA and the amplification products of the artificial internal standard in the step (2), and any extension primer in the extension primer mixture extends one base at the 3' end to obtain a corresponding extension product.
8. The method of claim 1, wherein the copy number variation of the long fragment of the gene is detected by: in the step (7), two mass spectrum peaks are obtained according to MassARRAY flight time mass spectrum detection, and the FREQ data of the two mass spectrum peaks are calculated according to the following formula to obtain a copy number variation coefficient (TR value); wherein, the two mass spectrum peaks are the peak of a human genome DNA (gDNA) fragment and the peak of an artificial internal standard fragment respectively; the copy number of the artificial internal standard of the target detection gene fragment and the copy number of the artificial internal standard of the internal reference gene fragment are the copy numbers added into the system;
according to the copy number variation coefficient TR value, when TR is less than or equal to 0.65, the target detection gene fragment of the sample to be detected is deletion positive; when TR is more than 0.8 and less than 1.2, the copy number of the target detection gene fragment of the sample to be detected is normal, and the result is negative; when TR is more than or equal to 1.5, the target detection gene fragment of the sample to be detected is repeatedly positive.
9. A kit for use in a method for detecting copy number variation of a long fragment of a gene according to any one of claims 1 to 8, wherein: comprises a sample detection hole W1 and a sample detection hole W2; the sample detection hole W1 contains a W1 multiplex PCR reaction system, an SAP reaction system and a W1 extension reaction system; the W1 multiplex PCR reaction system comprises PCR Buffer and MgCl2dNTPs, dUTP, PCR primer Pool, artificial internal standard, UNG enzyme and PCR enzyme; wherein the sequence of the artificial internal standard is shown as SEQ ID NO: 100-114; the sample detection hole W2 contains a W2 multiplex PCR reaction system, an SAP reaction system and a W2 extension reaction system; the W2 multiplex PCR reaction system comprises PCR Buffer and MgCl2dNTPs, dUTP, PCR primer Pool, artificial internal standard, UNG enzyme and PCR enzyme; wherein the sequence of the artificial internal standard is shown as SEQ ID NO: 115-129.
10. The kit of claim 9, wherein: the sequence of the PCR primer Pool in the W1 multiplex PCR reaction system is shown as SEQ ID NO. 34-65; the W1 extension reaction system comprises Buffer Plus, Termination Mix, W1 extension primer mixture and iPLEX Enzyme; the sequence of the W1 extension primer mixture is shown as SEQ ID NO. 1-16; the sequence of the PCR primer Pool in the W2 multiplex PCR reaction system is shown as SEQ ID NO. 66-99; the W2 extension reaction system comprises Buffer Plus, Termination Mix, W2 extension primer mixture and iPLEX Enzyme; the sequence of the W2 extension primer mixture is shown as SEQ ID NO. 17-33; the SAP reaction system includes an SAP enzyme and an SAP Buffer.
Background
There are a wide range of variations in the human genome, including Single Nucleotide Polymorphisms (SNPs), variable number tandem repeat polymorphisms (VNTRs), and alterations in gene structure. The genomic structural variation can be divided into two levels according to size: microscopic level and sub-microscopic level. Microscopic level mainly refers to chromosome aberration visible under a microscope, including structural variations such as euploid, aneuploidy, deletion, insertion, inversion, translocation, fragile site and the like. The sub-microscopic level refers to the genomic structural variation of DNA fragments with length of 1Kb to several Mb, including deletion, duplication, insertion, inversion, translocation, and complex rearrangement, which are collectively called large fragment rearrangement (LGR). And those in which a change in the number of copies can be caused are called Copy Number Variation (CNV). It has been estimated by research that CNV accounts for at least 12% of the genome and has become a further important source of genomic polymorphisms.
Gene mutations may result in protein truncation, disruption of mRNA processing, or amino acid substitutions that have a significant effect on protein function, thereby causing various diseases. The detection of gene mutation is of great significance to the prevention, treatment and prognosis monitoring of human diseases. For example, the mutation frequency of the breast cancer susceptibility gene BRCA1/2 in Chinese familial breast cancer is 10.5%, and the mutation frequency of BRCA1/2 in Caucasian familial breast cancer is 15% -20%. The incidence frequency of BRCA1/2 gene Copy Number Variation (CNV) in different region population is 0.2% -12.2%, accounting for 0.9% -21.4% of BRCA1/2 total pathogenic gene mutation.
Most point mutations and insertions of short fragments are easily detected by PCR and sequencing. However, long fragment Copy Number Variations (CNV) of DNA greater than 1Kb are currently difficult to detect by sequencing. The existing methods for detecting CNV are few, and the most used method is multiplex ligation-dependent probe amplification (MLPA). However, the method has the advantages of short covered fragment, high cost, large sample consumption and small flux. Therefore, it is urgently needed to explore a new method capable of detecting CNV with low cost and high flux.
Disclosure of Invention
The invention aims to provide a novel method for detecting copy number variation of long segments of genes. The method creatively adds an artificial internal standard (completer), adopts specific PCR primers and extension primers, and is based on multiple PCR reaction of MassARRAY flight time mass spectrum, so that the sample can be detected at high flux, and the cost is reduced. The copy number variation result provides gene detection information more comprehensively, and is helpful for assisting a doctor in diagnosis.
In order to achieve the above object, the present invention comprises the steps of:
(1) extracting human genome DNA of a sample to be detected by adopting a commercial DNA extraction kit;
(2) taking human genome DNA of a sample to be detected as a template, adding an artificial internal standard (synthesizer), and simultaneously obtaining an amplification product of the human genome DNA and an amplification product of the artificial internal standard through multiple PCR amplification reaction;
the added artificial internal standard (synthesizer) is a synthesized double-stranded DNA fragment or a plasmid containing the synthesized double-stranded DNA fragment. Wherein the synthesized double-stranded DNA fragment is only 1 base different from the target amplified fragment of the sample to be detected, and the different bases on the two fragments are adjacent to the 3' end of the extension primer.
The amplified products of the human genome DNA and the amplified products of the artificial internal standard obtained by the reaction both comprise target detection gene fragments and reference gene fragments. The length of the target detection gene fragment is 70-600 bp; the reference gene is one or more of RNaseP, EIF2C1, ALB and GAPDH.
The system of the multiplex PCR amplification reaction comprises PCR Buffer and MgCl2dNTPs, dUTP, PCR primers Pool, UNG enzyme and PCR enzyme; and a negative sample needs to be set for each experiment.
PCR primer Pool is a specific target fragment amplification primer mixture. The PCR primer Pool is a mixture of a target detection gene amplification primer and an internal reference gene amplification primer. One PCR primer Pool contains a plurality of pairs of primers, and the number of the primers is less than or equal to 30. If the number of the primers exceeds 30, dividing the primers into 2 or more PCR primers Pool, and detecting by using 2 or more sample detection holes;
(3) the product of the amplification reaction is reacted with Shrimp Alkaline Phosphatase (SAP) to remove excess deoxyribonucleoside triphosphate (dNTP/dUTP). The Shrimp Alkaline Phosphatase (SAP) reaction system comprises SAP enzyme and SAP Buffer;
(4) and (3) extending the product of the step (3) by a base at the 3' end of the extension primer through an extension reaction to obtain an extension product of the human genome DNA and an extension product of the artificial internal standard.
The extension reaction system comprises Buffer Plus, Termination Mix, extension primer mixture and iPLEX Enzyme.
And (3) the extension primer mixture is matched with the sequences of the amplification products of the human genome DNA and the amplification products of the artificial internal standard in the step (2), and any extension primer in the extension primer mixture extends one base at the 3' end to obtain a corresponding extension product.
An extension primer mixture comprises a plurality of primers, and the number of the primers is less than or equal to 30. If more than 30 primers are used, dividing the mixture into 2 or more extended primers, and detecting the mixture by using 2 or more sample detection holes;
(5) purifying the product of the extension reaction by resin, and desalting;
(6) spotting the desalted product on a mass spectrum chip substrate for co-crystallization, transferring the crystal into a vacuum tube of a mass spectrometer, and exciting by laser to perform MassARRAY flight time mass spectrum detection;
(7) and calculating the TR value of the copy number variation coefficient according to the FREQ data obtained by MassARRAY flight time mass spectrum detection, and judging the copy number of the gene fragment.
Purifying the extension product and performing MassARRAY flight time mass spectrum detection to obtain two mass spectrum peaks, and calculating by using FREQ data of the two mass spectrum peaks according to the following formula to obtain a copy number variation coefficient (TR value); wherein the two mass spectrum peaks are the peak of a human genome DNA (gDNA) fragment and the peak of an artificial internal standard fragment respectively; the copy number of the artificial internal standard of the target detection gene fragment and the copy number of the artificial internal standard of the internal reference gene fragment are the copy numbers added into the system.
Because there are n reference genes, the copy number ratio of each sample to be tested to each negative sample has n numerical values, and the median is taken. Since there are n negative samples, n medians are obtained, and the final copy number coefficient of variation (TR value) is obtained by averaging.
According to the copy number variation coefficient (TR value), when the TR is less than or equal to 0.65, the target detection gene fragment of the sample to be detected is deletion positive; when TR is more than 0.8 and less than 1.2, the copy number of the target detection gene fragment of the sample to be detected is normal, and the result is negative; when TR is more than or equal to 1.5, the target detection gene fragment of the sample to be detected is repeatedly positive.
A kit based on the method for detecting the copy number variation of the gene long fragment comprises a sample detection hole W1 and a sample detection hole W2; the sample detection hole W1 comprises a W1 multiplex PCR reaction system, a Shrimp Alkaline Phosphatase (SAP) reaction system and a W1 extension reaction system; the sample detection well W2 includes a W2 multiplex PCR reaction system, a Shrimp Alkaline Phosphatase (SAP) reaction system, and a W2 extension reaction system.
The W1 multiplex PCR reaction system comprises PCR Buffer and MgCl2dNTPs, dUTP, PCR primers Pool, artificial internal standard (compositor), UNG enzyme and PCR enzyme.
Further, the sequence of the PCR primer Pool in the W1 multiplex PCR reaction system is as follows:
further, the sequences of the artificial internal standards in the W1 multiplex PCR reaction system are as follows:
the W2 multiplex PCR reaction system comprises PCR Buffer and MgCl2dNTPs, dUTP, PCR primer Pool, artificial internal standard, UNG enzyme and PCR enzyme.
Further, the sequence of the PCR primer Pool in the W2 multiplex PCR reaction system is as follows:
F17
GTTGGATGTGTGAATTTTCTGAGACGGATG
SEQ ID NO:66
F18
ACGTTGGATGTGACGTGTCTGCTCCACTTC
SEQ ID NO:67
F19
GGATGTTTGTTTTCTCATTCCATTTAAAGC
SEQ ID NO:68
F20
CGTTGGATGTTTTTAGCAAAAGCGTCCAGA
SEQ ID NO:69
F21
TGGATGTCTCCTGAACATCTAAAAGATGAA
SEQ ID NO:70
F22
ACGTTGGATGCAGAGGGATACCATGCAACA
SEQ ID NO:71
F23
CGTTGGATGAAAGCAGATTTGGCAGTTCAA
SEQ ID NO:72
F24
GTTGGATGAACTTCTCCCATTCCTTTCAGA
SEQ ID NO:73
F25
ACGTTGGATGGGTAGAGGGCCTGGGTTAAG
SEQ ID NO:74
F26
ACGTTGGATGGCTCTTTAGCTTCTTAGGAC
SEQ ID NO:75
F27
TGGTTTTTCTAATGTGTTAAAGTTCATTGG
SEQ ID NO:76
F28
ACGTTGGATGTTTCTCAGATAACTGGGCCCCTG
SEQ ID NO:77
F29
ACGTTGGATGTTCCCCTGTCCCTCTCTCTT
SEQ ID NO:78
F30
CGTTGGATGTTCAGGAGGAAAAGCACAGAA
SEQ ID NO:79
R17
ACGTTGGATGGCTGTAATGAGCTGGCATGA
SEQ ID NO:80
R18
ACGTTGGATGTGTCCTGGGATTCTCTTGCT
SEQ ID NO:81
R19
CGTTGGATGCCTTTCCACTCCTGGTTCTTT
SEQ ID NO:82
R20
ACGTTGGATGAAGTGTTGGAAGCAGGGAAG
SEQ ID NO:83
R21
ACGTTGGATGGCCACAGAGCAAGACTCCAT
SEQ ID NO:84
R22
ACGTTGGATGTTTGGCCAACAATACACACC
SEQ ID NO:85
R23
GTTGGATGGCTTTCGTTTTGAAAGCAGATT
SEQ ID NO:86
R24
ACGTTGGATGTGACTCTGGGGCTCTGTCTT
SEQ ID NO:87
R25
ACGTTGGATGTGTGTCCTCCCTCTCTGACA
SEQ ID NO:88
R26
ACGTTGGATGCATTTTCCTCCCGCAATTCC
SEQ ID NO:89
R27
ACGTTGGATGCAGAGTGGATGGAGAACAAGG
SEQ ID NO:90
R28
ACGTTGGATGCCTCTCAGGTTCCGCCCC
SEQ ID NO:91
R29
ACGTTGGATGGGTTCTCCCAGGCTCTTACC
SEQ ID NO:92
R30
CGTTGGATGCCTGAGACCCTTACCCAATTC
SEQ ID NO:93
F31
ACGTTGGATGGGTCAGCTCTTCCCTTCATC
SEQ ID NO:94
F32
ACGTTGGATGGTGGTTCGGCTTTCACCAGTCTG
SEQ ID NO:95
F33
ACGTTGGATGAGCAACCTGTTACATATTAAAGTT
SEQ ID NO:96
R31
ACGTTGGATGCCTCCCACATGTAATGTGTTG
SEQ ID NO:97
R32
ACGTTGGATGGGTGTGGTCACTGGACTTGGG
SEQ ID NO:98
R33
ACGTTGGATGATACTGAGCAAAGGCAATCAAC
SEQ ID NO:99
further, the sequences of the artificial internal standards in the W2 multiplex PCR reaction system are as follows:
the SAP reaction systems of W1 and W2 both included SAP enzyme and SAP Buffer.
The W1 extension reaction system comprises Buffer Plus, Termination Mix, W1 extension primer mixture and iPLEX Enzyme; the W2 extension reaction system included Buffer Plus, Termination Mix, W2 extension primer Mix, and iPLEX Enzyme.
Further, the sequences of the W1 extension primer mixture are as follows:
UEP1
GCTGAAACTTCTCAACC
SEQ ID NO:1
UEP2
CTGGGAGCTCCTCTCACTC
SEQ ID NO:2
UEP3
GAGTTGATCAAGGAACCTG
SEQ ID NO:3
UEP4
CACAGCAGAAACCTACAACTC
SEQ ID NO:4
UEP5
AAAGGCATCTGGCTGCACAACC
SEQ ID NO:5
UEP6
GGGTGAGAGGATAGCCCTGAGC
SEQ ID NO:6
UEP7
ACAAGAAAGTACGAGATTTAGTC
SEQ ID NO:7
UEP8
ATTCAGCATTTTTCTTTCTTTAAT
SEQ ID NO:8
UEP9
ATTAGATTAGTTAAAGTGATGTGG
SEQ ID NO:9
UEP10
TGGAGATCAAGAATTGTTACAAATC
SEQ ID NO:10
UEP11
GGATGTTCTCATTTCCCATTTCTCTT
SEQ ID NO:11
UEP12
CCCTCTGAAGATACCGTTAATAAGGC
SEQ ID NO:12
UEP13
TGAGATGGGTAGTTTCTATTCTGAAG
SEQ ID NO:13
UEP14
CACCCTTCCATCATAAGTGAC
SEQ ID NO:14
UEP15
TCAACATTGATGGTG
SEQ ID NO:15
UEP16
GGGAAACAAAGGACACCGTTA
SEQ ID NO:16
the sequences of the W2 extension primer mixture were as follows:
compared with the prior art, the invention has the following beneficial effects because the technology is adopted:
the invention successfully establishes a new method for detecting the copy number variation of the gene long fragment, and the application of the invention to the detection of the copy number variation of the gene long fragment can cover longer detection sequences, can simultaneously carry out 384 detections, and has the characteristics of large flux, high accuracy, low cost and the like.
Drawings
FIG. 1 is a graph showing the results of MassARRAY in which a gene fragment is deleted;
FIG. 2 is a graph showing the result of MassARRAY in which gene fragments are duplicated;
FIG. 3 is a histogram of the results of gene fragment deletions;
FIG. 4 is a histogram of the results of gene fragment duplication;
FIG. 5 is a schematic diagram showing a comparison of an artificial internal standard with human genomic DNA, in which the extension position at the 3' end of the extension primer differs by one base;
FIG. 6 is a schematic diagram of an extended product peak (artificial internal standard peak) of an artificial internal standard and an extended product peak (target peak) of human genomic DNA.
Detailed Description
The present invention will be further illustrated with reference to the following specific embodiments.
Example 1
Detection of copy number variation of the long fragment of the BRCA1 gene:
1) preparation of DNA samples
Extraction of human genomic DNA Using a commercial DNA extraction kit. The quality of the obtained DNA is detected by using Nano300 to meet OD260/OD280Between 1.8 and 2.0, the concentration is more than 10 ng/mu l. The qualified DNA samples were diluted to 10 ng/. mu.l and kept at 4 ℃ until use.
2) PCR amplification reaction
Preferred amplification primer combinations for this application are shown in tables 1-4, and the amplification primers are specific for 23 exons of reference gene and BRCA1 gene. The target segments of the internal reference gene amplification primer and the extension primer are one segment of the human genes RNase P, EIF2C1 and ALB. Due to more than 30 primer pairs, each sample was tested in 2 wells, W1 and W2, respectively. Mixing 16 pairs of primers in W1 according to a final concentration of 0.5 mu M respectively to prepare a PCR primer Pool of W1; 17 pairs of primers in W2 were mixed at a final concentration of 0.5. mu.M, respectively, to prepare PCR primers Pool of W2.
TABLE 1 BRCA1 Gene amplification primer sequences of W1
F1
GATGTTTGCCTTTTGAGTATTCTTTCTACA
SEQ ID NO:34
F2
ACGTTGGATGACTCTACCAGTGCCAGGAGC
SEQ ID NO:35
F3
ACGTTGGATGTTCTTTTTCTCCCCCCCTAC
SEQ ID NO:36
F4
ACGTTGGATGGTTGTTCTAGCAGTGAAGAG
SEQ ID NO:37
F5
GGATGGATGAAGTGACAGTTCCAGTAGTCC
SEQ ID NO:38
F6
ACGTTGGATGCACGCTTTTTACCTGAGTGG
SEQ ID NO:39
F7
GTTGGATGCACTTGCTGAGTGTGTTTCTCA
SEQ ID NO:40
F8
CGTTGGATGAAAGAGCACGTTCTTCTGCTG
SEQ ID NO:41
F9
CGTTGGATGTTCTGAGCTGTGTGCTAGAGG
SEQ ID NO:42
F10
GTTGGATGTGGTCAGCTTTCTGTAATCGAA
SEQ ID NO:43
F11
ACGTTGGATGACGGCTAATTGTGCTCACTG
SEQ ID NO:44
F12
GTTGGATGAGGATCTGATTCTTCTGAAGATAC
SEQ ID NO:45
F13
ACGTTGGATGATGAGCTCCTCTTGAGATGG
SEQ ID NO:46
F14
ACGTTGGATGCAGAGGGATACCATGCAACA
SEQ ID NO:47
R1
ACGTTGGATGCCTACTGTGGTTGCTTCCAA
SEQ ID NO:48
R2
ACGTTGGATGATAATTTTGTGCTCATGGCAGA
SEQ ID NO:49
R3
ACGTTGGATGTGGAGCCACATAACACATTC
SEQ ID NO:50
R4
ACGTTGGATGACTCTTCTTGGCTCCAGTTG
SEQ ID NO:51
R5
ACGTTGGATGTACATGCAGGCACCTTACCA
SEQ ID NO:52
R6
ACGTTGGATGTAAGGTGAAGCAGCATCTGG
SEQ ID NO:53
R7
ACGTTGGATGTCCAAACCTGTGTCAAGCTG
SEQ ID NO:54
R8
ACGTTGGATGGGTGCATTGATGGAAGGAAG
SEQ ID NO:55
R9
ACGTTGGATGTTTATGCAGCAGATGCAAGG
SEQ ID NO:56
R10
CGTTGGATGTTTTTGCAGAATCCAAACTGA
SEQ ID NO:57
R11
ACGTTGGATGGCTAGAGGAAAACTTTGAGG
SEQ ID NO:58
R12
ACGTTGGATGGTTCTCTTTGACTCACCTGCAA
SEQ ID NO:59
R13
ACGTTGGATGGTCATCCCCTTCTAAATGCC
SEQ ID NO:60
R14
ACGTTGGATGTTTGGCCAACAATACACACC
SEQ ID NO:61
TABLE 2 reference Gene amplification primer sequences of W1
F15
ACGTTGGATGGTGGTTCGGCTTTCACCAGTCTG
SEQ ID NO:62
F16
ACGTTGGATGGGTCAGCTCTTCCCTTCATC
SEQ ID NO:63
R15
ACGTTGGATGGGTGTGGTCACTGGACTTGGG
SEQ ID NO:64
R16
ACGTTGGATGCCTCCCACATGTAATGTGTTG
SEQ ID NO:65
TABLE 3 BRCA1 Gene amplification primer sequences of W2
F17
GTTGGATGTGTGAATTTTCTGAGACGGATG
SEQ ID NO:66
F18
ACGTTGGATGTGACGTGTCTGCTCCACTTC
SEQ ID NO:67
F19
GGATGTTTGTTTTCTCATTCCATTTAAAGC
SEQ ID NO:68
F20
CGTTGGATGTTTTTAGCAAAAGCGTCCAGA
SEQ ID NO:69
F21
TGGATGTCTCCTGAACATCTAAAAGATGAA
SEQ ID NO:70
F22
ACGTTGGATGCAGAGGGATACCATGCAACA
SEQ ID NO:71
F23
CGTTGGATGAAAGCAGATTTGGCAGTTCAA
SEQ ID NO:72
F24
GTTGGATGAACTTCTCCCATTCCTTTCAGA
SEQ ID NO:73
F25
ACGTTGGATGGGTAGAGGGCCTGGGTTAAG
SEQ ID NO:74
F26
ACGTTGGATGGCTCTTTAGCTTCTTAGGAC
SEQ ID NO:75
F27
TGGTTTTTCTAATGTGTTAAAGTTCATTGG
SEQ ID NO:76
F28
ACGTTGGATGTTTCTCAGATAACTGGGCCCCTG
SEQ ID NO:77
F29
ACGTTGGATGTTCCCCTGTCCCTCTCTCTT
SEQ ID NO:78
F30
CGTTGGATGTTCAGGAGGAAAAGCACAGAA
SEQ ID NO:79
R17
ACGTTGGATGGCTGTAATGAGCTGGCATGA
SEQ ID NO:80
R18
ACGTTGGATGTGTCCTGGGATTCTCTTGCT
SEQ ID NO:81
R19
CGTTGGATGCCTTTCCACTCCTGGTTCTTT
SEQ ID NO:82
R20
ACGTTGGATGAAGTGTTGGAAGCAGGGAAG
SEQ ID NO:83
R21
ACGTTGGATGGCCACAGAGCAAGACTCCAT
SEQ ID NO:84
R22
ACGTTGGATGTTTGGCCAACAATACACACC
SEQ ID NO:85
R23
GTTGGATGGCTTTCGTTTTGAAAGCAGATT
SEQ ID NO:86
R24
ACGTTGGATGTGACTCTGGGGCTCTGTCTT
SEQ ID NO:87
R25
ACGTTGGATGTGTGTCCTCCCTCTCTGACA
SEQ ID NO:88
R26
ACGTTGGATGCATTTTCCTCCCGCAATTCC
SEQ ID NO:89
R27
ACGTTGGATGCAGAGTGGATGGAGAACAAGG
SEQ ID NO:90
R28
ACGTTGGATGCCTCTCAGGTTCCGCCCC
SEQ ID NO:91
R29
ACGTTGGATGGGTTCTCCCAGGCTCTTACC
SEQ ID NO:92
R30
CGTTGGATGCCTGAGACCCTTACCCAATTC
SEQ ID NO:93
TABLE 4 reference Gene amplification primer sequences of W2
F31
ACGTTGGATGGGTCAGCTCTTCCCTTCATC
SEQ ID NO:94
F32
ACGTTGGATGGTGGTTCGGCTTTCACCAGTCTG
SEQ ID NO:95
F33
ACGTTGGATGAGCAACCTGTTACATATTAAAGTT
SEQ ID NO:96
R31
ACGTTGGATGCCTCCCACATGTAATGTGTTG
SEQ ID NO:97
R32
ACGTTGGATGGGTGTGGTCACTGGACTTGGG
SEQ ID NO:98
R33
ACGTTGGATGATACTGAGCAAAGGCAATCAAC
SEQ ID NO:99
The kit for detecting the copy number variation of the long fragment of the BRCA1 gene is provided with an artificial internal standard (synthesizer), wherein the artificial internal standard is a double-stranded DNA synthetic fragment or a plasmid containing the double-stranded DNA synthetic fragment, and the double-stranded DNA synthetic fragment and the target amplified fragment have only 1 base difference. The different base is the 3' -end base (shown in FIGS. 5 and 6) extended by the extension primer during the extension reaction, and serves as an internal control. Specific artificial internal standard sequences are shown in tables 5-6. 15 artificial internal standards of W1 were mixed in amounts corresponding to a final concentration of 3000 copies/. mu.l human genomic DNA; 15 artificial internal standards of W2 were also mixed at a final concentration equivalent to 3000 copies/. mu.l of human genomic DNA. A W1 artificial internal standard mixture and a W2 artificial internal standard mixture were obtained, respectively.
TABLE 5 Artificial internal standard sequence of W1
TABLE 6 Artificial internal standard sequence of W2
The multiple PCR reaction system used by the kit for detecting the copy number variation of the long fragment of the BRCA1 gene contains PCR Buffer and MgCl2dNTPs, UNG enzyme and PCR enzyme (available from agena bioscience, cat # 21327L), and dUTP (available from shanghai megabimei, cat # D5331), PCR primer Pool and artificial internal standard mixture. Each sample was tested in two wells, sample test well W1 and sample test well W2. The PCR amplification reaction system is shown in Table 7, wherein the PCR primer Pool of W1 and the artificial internal standard mixture are used in W1 well, and the PCR primer Pool of W2 and the artificial internal standard mixture are used in W2 well. The PCR amplification reaction procedure is shown in Table 8.
TABLE 7 PCR amplification reaction System
Reagent
Single well volume/. mu.l
ddH2O
6.4
10×PCR Buffer
2
MgCl2(25mM)
3.2
dNTP Mix(25mM)
0.4
dUTP(100mM)
0.1
PCR primer Pool
2.4
UNG Enzyme(5U/μl)
0.5
PCR Enzyme(5U/μl)
1
Artificial internal standard mixture
2
DNA(10ng/μl)
2
Total volume
20
TABLE 8 PCR amplification reaction procedure
3) SAP reaction
The SAP reaction system contained SAP enzyme and SAP Buffer (available from Agena Bioscience, Inc., cat. 10141). After PCR amplification, 5ul of product was taken for SAP reaction. The SAP reaction system is shown in Table 9 and the reaction procedure is shown in Table 10.
TABLE 9 SAP reaction System
Reagent
Single well volume/. mu.l
PCR amplification product
5
ddH2O
1.53
SAP Buffer
0.17
SAP enzymes
0.30
Total volume
7
TABLE 10 SAP reaction procedure
37℃
40min
85℃
5min
4℃
Forever
4) Extension reaction
The above extension reaction system contained Buffer Plus, Termination Mix and iPLEX Enzyme (available from Agena Bioscience, Inc., cat. 10141), and an extension primer mixture. Preferred sequences of extension primers combinations for the present application are shown in tables 12-15, and the extension primers are specific for BRCA1 gene and reference gene. 16 extension primers (UEP) in W1 are mixed according to the rule of Table 11 to prepare a UEP mixture of W1; the 17 extension primers (UEP) in W2 were mixed according to the rules of Table 11 to prepare a UEP mixture of W2. After the completion of the SAP reaction, an elongation reaction was performed. The elongation reaction system is shown in Table 16, wherein the UEP mixture of W1 is used for W1 pores, and the UEP mixture of W2 is used for W2 pores. The reaction procedure is shown in Table 17.
TABLE 11 extended primer (UEP) mixture formulation Table
UEP molecular weight/Da
Final concentration/. mu.M in UEP mixtures
<5000
4
5000-6000
6
6000-7000
8
>7000
12
TABLE 12 BRCA1 Gene extension primer sequences of W1
UEP1
GCTGAAACTTCTCAACC
SEQ ID NO:1
UEP2
CTGGGAGCTCCTCTCACTC
SEQ ID NO:2
UEP3
GAGTTGATCAAGGAACCTG
SEQ ID NO:3
UEP4
CACAGCAGAAACCTACAACTC
SEQ ID NO:4
UEP5
AAAGGCATCTGGCTGCACAACC
SEQ ID NO:5
UEP6
GGGTGAGAGGATAGCCCTGAGC
SEQ ID NO:6
UEP7
ACAAGAAAGTACGAGATTTAGTC
SEQ ID NO:7
UEP8
ATTCAGCATTTTTCTTTCTTTAAT
SEQ ID NO:8
UEP9
ATTAGATTAGTTAAAGTGATGTGG
SEQ ID NO:9
UEP10
TGGAGATCAAGAATTGTTACAAATC
SEQ ID NO:10
UEP11
GGATGTTCTCATTTCCCATTTCTCTT
SEQ ID NO:11
UEP12
CCCTCTGAAGATACCGTTAATAAGGC
SEQ ID NO:12
UEP13
TGAGATGGGTAGTTTCTATTCTGAAG
SEQ ID NO:13
UEP14
CACCCTTCCATCATAAGTGAC
SEQ ID NO:14
Reference Gene extension primer sequences of Table 13.W1
UEP15
TCAACATTGATGGTG
SEQ ID NO:15
UEP16
GGGAAACAAAGGACACCGTTA
SEQ ID NO:16
Table 14.W2 BRCA1 Gene extension primer sequences
UEP17
ACTGAGAAGCGTGCAGC
SEQ ID NO:17
UEP18
CGACCTTGGTGGTTTC
SEQ ID NO:18
UEP19
CCTCAAACTTGTCAGCAGAA
SEQ ID NO:19
UEP20
CTCCGGAGTCCTAGCCCTTTC
SEQ ID NO:20
UEP21
GGTTCACTCTGTAGAAGTCTTT
SEQ ID NO:21
UEP22
CCATGTTCTAACACAGCTTCTAG
SEQ ID NO:22
UEP23
GGAGTCTGCTCCGTTTGGTTAGT
SEQ ID NO:23
UEP24
AACACTTACCTGGAATCTGGAATC
SEQ ID NO:24
UEP25
GGATGGTGAATGATGAAAGCTCCT
SEQ ID NO:25
UEP26
GGGGAGATGCTGAGTTTGTGTGTG
SEQ ID NO:26
UEP27
TAAGAAGTACAAAATGTC
SEQ ID NO:27
UEP28
CCTGTCCCTTTCCCGGG
SEQ ID NO:28
UEP29
TGTTGGCATGTTGGTGAAGGGCCCAT
SEQ ID NO:29
UEP30
CGGGTCCAACTCTCTAACCTTGGAACTG
SEQ ID NO:30
Reference gene extension primer sequences of Table 15.W2
UEP31
TGATAACAAAGGACACCGTTA
EQ ID NO31
UEP32
GGGAGGCTCAACATTGATGGTG
EQ ID NO32
UEP33
TTACTACATTTTTCTACATCCTTTGTTT
EQ ID NO33
TABLE 16 elongation reaction System
Reagent
Single well volume/. mu.l
SAP products
7
ddH2O
0.619
Buffer Plus(10×)
0.2
Termination Mix
0.2
UEP mixture
0.94
iPLEX Pro Enzyme
0.041
Total volume
9
TABLE 17 elongation reaction procedure
5) Desalination
19ul of ultrapure water was added to the product of the elongation reaction, and the mixture was placed in a MassARRAY-CPM machine, and 10ul of resin was added to the machine, desalted and purified, and allowed to stand for settling.
6) Mass spectrometric detection
MassARRAY-CPM automatically spots the product on a chip and moves into a vacuum tube, then uses laser excitation to perform time-of-flight mass spectrometry. A chip can detect 192 samples at most simultaneously, so that the flux is increased, and the cost is reduced.
2. Data analysis
The scatter plot can be analyzed by opening the experimental results with the Typer software. The result of a missing or duplicated exon is scattered outside the pile of normal samples (as circled in FIGS. 1 and 2). We needed further analysis of the Data, clicking the View-Plate Data Panel export Data sheet in the Typer software, resulting in the FREQ value (Table 18). The BRCA1 gene fragment Exon15 is taken as an example.
TABLE 18 FREQ values
Sample name
Detecting fragments
FREQ of gDNA
FREQ with artificial internal standard
Sample 1 to be tested
ALB
0.383216
0.616784
Sample 2 to be tested
ALB
0.361839
0.638161
Negative sample 1
ALB
0.461943
0.538056
Negative sample 2
ALB
0.384088
0.615913
Negative sample 3
ALB
0.356612
0.64339
Sample 1 to be tested
EIF2C1
0.573677
0.426323
Sample 2 to be tested
EIF2C1
0.514017
0.485983
Negative sample 1
EIF2C1
0.627566
0.372434
Negative sample 2
EIF2C1
0.535629
0.464371
Negative sample 3
EIF2C1
0.495153
0.504847
Sample 1 to be tested
RNaseP
0.441841
0.558159
Sample 2 to be tested
RNaseP
0.399637
0.600362
Negative sample 1
RNaseP
0.3908
0.6092
Negative sample 2
RNaseP
0.463194
0.536802
Negative sample 3
RNaseP
0.403123
0.596877
Sample 1 to be tested
Exon15
0.488966
0.511034
Sample 2 to be tested
Exon15
0.340837
0.659161
Negative sample 1
Exon15
0.546832
0.453167
Negative sample 2
Exon15
0.496558
0.503443
Negative sample 3
Exon15
0.438126
0.561872
Copy number was calculated according to the formula below, based on the FREQ parameters of human genomic dna (gdna) and the FREQ parameters of the artificial internal standards for each BRCA1 gene fragment in the data table.
In the embodiment, 3 reference genes exist, the copy number ratio of each sample to be detected to each negative sample has 3 numerical values, and a median is taken. Since there were 3 negative samples, 3 medians were obtained, and the final copy number coefficient of variation (TR value) was obtained by averaging.
Take Exon15 fragment of Well1 as an example, wherein the copy number of the artificial internal standard of the target detection gene fragment and the internal reference gene fragment added in the example is 6000.
1) Negative sample 1 copy number ratio
2) Negative sample 2 copy number ratio
3) Negative sample 3 copy number ratio
4) Copy number ratio of sample 1 to be tested
5) Copy number ratio of sample 2 to be tested
6) Copy number coefficient of variation of sample to be tested
Because 3 reference genes exist, the copy number ratio of each sample to be detected to each negative sample has 3 numerical values, and the median is 0.99.
Because 3 reference genes exist, the copy number ratio of each sample to be detected to each negative sample has 3 numerical values, and the median is 0.97.
Because 3 reference genes exist, the copy number ratio of each sample to be detected to each negative sample has 3 numerical values, and the median is 1.05.
Since there are 3 negative samples, 3 medians were obtained, and the 3 medians were averaged to obtain the final copy number coefficient of variation (TR value).
7) Copy number coefficient of variation of sample 2 to be tested
Because 3 reference genes exist, the copy number ratio of each sample to be detected to each negative sample has 3 numerical values, and the median is 0.65.
Because there are 3 internal reference genes, the copy number ratio of each sample to be tested to each negative sample has 3 numerical values, and the median is 0.58.
Because 3 reference genes exist, the copy number ratio of each sample to be detected to each negative sample has 3 numerical values, and the median is 0.65.
Since there are 3 negative samples, 3 medians were obtained, and the 3 medians were averaged to obtain the final copy number coefficient of variation (TR value).
According to the TR value, the judgment is as follows:
TR value
Result judgment
1
Sample 1 to be tested
0.8<TR<1.2
Exon15 Normal
2
Sample 2 to be tested
TR≤0.65
Exon15 deletion
According to the algorithm, after the RStudio editor is used for compiling codes, experimental data can be directly uploaded to obtain a final TR value. Calculating the copy numbers of other exons of the sample 1 to be detected and the sample 2 to be detected by the same method, wherein the Exon13 copy number variation coefficient (TR value) of the sample 1 to be detected is 1.9, and the result is judged to be that the Exon13 is repeated; the Exon16 copy number variation coefficient (TR value) of the sample 2 to be tested was 0.56, and the result was found to be Exon16 deletion. By making a histogram based on the TR values, the Exon13 histogram of the sample 1 to be tested is significantly higher than the other histograms (as shown in fig. 4). The histograms of Exon15 and Exon16 of sample 2 to be tested were significantly lower than the other histograms (as shown in fig. 3).
3. Quality control standard
Quality control, each test needs 2 positive controls, 3 negative controls and 3 blank controls. TR of the deletion positive control is less than or equal to 0.65, TR of the repeated positive control is more than or equal to 1.5, and TR of the blank control is 0.
TR value
Result judgment
1
TR=0
Blank control
2
TR≤0.65
Deletion positive control
3
TR≥1.5
Duplicate positive controls
4. The product performance index is as follows:
the method can specifically detect 1 or more deletions and repeats of 23 exons of BRCA 1.
Detecting 2 parts of positive reference substances and 10 parts of negative reference substances, wherein the coincidence rate is 100%; the enterprise repeated reference product is repeatedly detected for 10 times, and the results of 10 times are the same. And detecting an unknown sample, wherein the coincidence rate is 98.6%.
The invention develops a new method for detecting the copy number variation of the long segments of the gene, creatively adds an artificial internal standard (completer), adopts a specific PCR primer and an extension primer, and can detect the copy number variation of the long segments of the gene with large flux, high accuracy and low cost by the multiple PCR reaction based on MassARRAY flight time mass spectrum. The kit can make up for most of the loopholes of gene sequencing detection, and is helpful for providing gene detection information more comprehensively to assist doctors in diagnosis.
The above-mentioned embodiments are merely preferred embodiments of the present invention, and should not be construed as limiting the present invention, and the scope of the present invention should be defined by the claims, and equivalents including technical features of the claims, i.e., equivalent modifications within the scope of the present invention.
Sequence listing
<110> Shenyou genome institute (Nanjing) Co., Ltd
<120> a method for detecting copy number variation of long segments of genes
<150> 2020113518391
<151> 2020-11-26
<160> 129
<170> SIPOSequenceListing 1.0
<210> 1
<211> 17
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 1
gctgaaactt ctcaacc 17
<210> 2
<211> 19
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 2
ctgggagctc ctctcactc 19
<210> 3
<211> 19
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 3
gagttgatca aggaacctg 19
<210> 4
<211> 21
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 4
cacagcagaa acctacaact c 21
<210> 5
<211> 22
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 5
aaaggcatct ggctgcacaa cc 22
<210> 6
<211> 22
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 6
gggtgagagg atagccctga gc 22
<210> 7
<211> 23
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 7
acaagaaagt acgagattta gtc 23
<210> 8
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 8
attcagcatt tttctttctt taat 24
<210> 9
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 9
attagattag ttaaagtgat gtgg 24
<210> 10
<211> 25
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 10
tggagatcaa gaattgttac aaatc 25
<210> 11
<211> 26
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 11
ggatgttctc atttcccatt tctctt 26
<210> 12
<211> 26
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 12
ccctctgaag ataccgttaa taaggc 26
<210> 13
<211> 26
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 13
tgagatgggt agtttctatt ctgaag 26
<210> 14
<211> 21
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 14
cacccttcca tcataagtga c 21
<210> 15
<211> 15
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 15
tcaacattga tggtg 15
<210> 16
<211> 21
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 16
gggaaacaaa ggacaccgtt a 21
<210> 17
<211> 17
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 17
actgagaagc gtgcagc 17
<210> 18
<211> 16
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 18
cgaccttggt ggtttc 16
<210> 19
<211> 20
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 19
cctcaaactt gtcagcagaa 20
<210> 20
<211> 21
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 20
ctccggagtc ctagcccttt c 21
<210> 21
<211> 22
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 21
ggttcactct gtagaagtct tt 22
<210> 22
<211> 23
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 22
ccatgttcta acacagcttc tag 23
<210> 23
<211> 23
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 23
ggagtctgct ccgtttggtt agt 23
<210> 24
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 24
aacacttacc tggaatctgg aatc 24
<210> 25
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 25
ggatggtgaa tgatgaaagc tcct 24
<210> 26
<211> 24
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 26
ggggagatgc tgagtttgtg tgtg 24
<210> 27
<211> 18
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 27
taagaagtac aaaatgtc 18
<210> 28
<211> 17
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 28
cctgtccctt tcccggg 17
<210> 29
<211> 26
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 29
tgttggcatg ttggtgaagg gcccat 26
<210> 30
<211> 28
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 30
cgggtccaac tctctaacct tggaactg 28
<210> 31
<211> 21
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 31
tgataacaaa ggacaccgtt a 21
<210> 32
<211> 22
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 32
gggaggctca acattgatgg tg 22
<210> 33
<211> 28
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 33
ttactacatt tttctacatc ctttgttt 28
<210> 34
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 34
gatgtttgcc ttttgagtat tctttctaca 30
<210> 35
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 35
acgttggatg actctaccag tgccaggagc 30
<210> 36
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 36
acgttggatg ttctttttct ccccccctac 30
<210> 37
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 37
acgttggatg gttgttctag cagtgaagag 30
<210> 38
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 38
ggatggatga agtgacagtt ccagtagtcc 30
<210> 39
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 39
acgttggatg cacgcttttt acctgagtgg 30
<210> 40
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 40
gttggatgca cttgctgagt gtgtttctca 30
<210> 41
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 41
cgttggatga aagagcacgt tcttctgctg 30
<210> 42
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 42
cgttggatgt tctgagctgt gtgctagagg 30
<210> 43
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 43
gttggatgtg gtcagctttc tgtaatcgaa 30
<210> 44
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 44
acgttggatg acggctaatt gtgctcactg 30
<210> 45
<211> 32
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 45
gttggatgag gatctgattc ttctgaagat ac 32
<210> 46
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 46
acgttggatg atgagctcct cttgagatgg 30
<210> 47
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 47
acgttggatg cagagggata ccatgcaaca 30
<210> 48
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 48
acgttggatg cctactgtgg ttgcttccaa 30
<210> 49
<211> 32
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 49
acgttggatg ataattttgt gctcatggca ga 32
<210> 50
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 50
acgttggatg tggagccaca taacacattc 30
<210> 51
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 51
acgttggatg actcttcttg gctccagttg 30
<210> 52
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 52
acgttggatg tacatgcagg caccttacca 30
<210> 53
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 53
acgttggatg taaggtgaag cagcatctgg 30
<210> 54
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 54
acgttggatg tccaaacctg tgtcaagctg 30
<210> 55
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 55
acgttggatg ggtgcattga tggaaggaag 30
<210> 56
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 56
acgttggatg tttatgcagc agatgcaagg 30
<210> 57
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 57
cgttggatgt ttttgcagaa tccaaactga 30
<210> 58
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 58
acgttggatg gctagaggaa aactttgagg 30
<210> 59
<211> 32
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 59
acgttggatg gttctctttg actcacctgc aa 32
<210> 60
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 60
acgttggatg gtcatcccct tctaaatgcc 30
<210> 61
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 61
acgttggatg tttggccaac aatacacacc 30
<210> 62
<211> 33
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 62
acgttggatg gtggttcggc tttcaccagt ctg 33
<210> 63
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 63
acgttggatg ggtcagctct tcccttcatc 30
<210> 64
<211> 31
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 64
acgttggatg ggtgtggtca ctggacttgg g 31
<210> 65
<211> 31
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 65
acgttggatg cctcccacat gtaatgtgtt g 31
<210> 66
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 66
gttggatgtg tgaattttct gagacggatg 30
<210> 67
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 67
acgttggatg tgacgtgtct gctccacttc 30
<210> 68
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 68
ggatgtttgt tttctcattc catttaaagc 30
<210> 69
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 69
cgttggatgt ttttagcaaa agcgtccaga 30
<210> 70
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 70
tggatgtctc ctgaacatct aaaagatgaa 30
<210> 71
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 71
acgttggatg cagagggata ccatgcaaca 30
<210> 72
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 72
cgttggatga aagcagattt ggcagttcaa 30
<210> 73
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 73
gttggatgaa cttctcccat tcctttcaga 30
<210> 74
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 74
acgttggatg ggtagagggc ctgggttaag 30
<210> 75
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 75
acgttggatg gctctttagc ttcttaggac 30
<210> 76
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 76
tggtttttct aatgtgttaa agttcattgg 30
<210> 77
<211> 33
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 77
acgttggatg tttctcagat aactgggccc ctg 33
<210> 78
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 78
acgttggatg ttcccctgtc cctctctctt 30
<210> 79
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 79
cgttggatgt tcaggaggaa aagcacagaa 30
<210> 80
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 80
acgttggatg gctgtaatga gctggcatga 30
<210> 81
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 81
acgttggatg tgtcctggga ttctcttgct 30
<210> 82
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 82
cgttggatgc ctttccactc ctggttcttt 30
<210> 83
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 83
acgttggatg aagtgttgga agcagggaag 30
<210> 84
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 84
acgttggatg gccacagagc aagactccat 30
<210> 85
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 85
acgttggatg tttggccaac aatacacacc 30
<210> 86
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 86
gttggatggc tttcgttttg aaagcagatt 30
<210> 87
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 87
acgttggatg tgactctggg gctctgtctt 30
<210> 88
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 88
acgttggatg tgtgtcctcc ctctctgaca 30
<210> 89
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 89
acgttggatg cattttcctc ccgcaattcc 30
<210> 90
<211> 31
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 90
acgttggatg cagagtggat ggagaacaag g 31
<210> 91
<211> 28
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 91
acgttggatg cctctcaggt tccgcccc 28
<210> 92
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 92
acgttggatg ggttctccca ggctcttacc 30
<210> 93
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 93
cgttggatgc ctgagaccct tacccaattc 30
<210> 94
<211> 30
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 94
acgttggatg ggtcagctct tcccttcatc 30
<210> 95
<211> 33
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 95
acgttggatg gtggttcggc tttcaccagt ctg 33
<210> 96
<211> 34
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 96
acgttggatg agcaacctgt tacatattaa agtt 34
<210> 97
<211> 31
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 97
acgttggatg cctcccacat gtaatgtgtt g 31
<210> 98
<211> 31
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 98
acgttggatg ggtgtggtca ctggacttgg g 31
<210> 99
<211> 32
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 99
acgttggatg atactgagca aaggcaatca ac 32
<210> 100
<211> 287
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 100
ttctttttct ccccccctac cctgctagtc tggagttgat caaggaacct gactccacaa 60
agtgtgacca catattttgc aagtaagttt gaatgtgtta tgtggctcca ttttcaaaac 120
tgaaaaactc atatattcag tattttactc ccacagcacc tccccccaat ttgacccaca 180
gggaccccca tccaggtgca gggtcctcgc ctgtgtacag ggcacacctt tggtcactcc 240
aaattcccag agctcccagg gtccttctca gggtctccac ctggatg 287
<210> 101
<211> 372
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 101
tttgcctttt gagtattctt tctacaaaag gaagtaaatt aaattgttct ttctttcttt 60
ataatttata gattttgcat gctgaaactt ctcaacctga agaaagggcc ttcacagtgt 120
cctttatgta agaatgatat aaccaaaagg tatataattt ggtaatgatg ctaggttgga 180
agcaaccaca gtaggttttc aaaactgaaa aactcatata ttcagtattt tactcccaca 240
gcacctcccc ccaatttgac ccacagggac ccccatccag gtgcagggtc ctcgcctgtg 300
tacagggcac acctttggtc actccaaatt cccagagctc ccagggtcct tctcagggtc 360
tccacctgga tg 372
<210> 102
<211> 302
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 102
cacttgctga gtgtgtttct caaacaattt aatttcagga gcctacaaga aagtacgaga 60
tttagtctac ttgttgaaga gctattgaaa atcatttgtg cttttcagct tgacacaggt 120
ttggattttc aaaactgaaa aactcatata ttcagtattt tactcccaca gcacctcccc 180
ccaatttgac ccacagggac ccccatccag gtgcagggtc ctcgcctgtg tacagggcac 240
acctttggtc actccaaatt cccagagctc ccagggtcct tctcagggtc tccacctgga 300
tg 302
<210> 103
<211> 366
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 103
ctcagtaaat atttctagtt gaatatctgt ttttcaacaa gtacattttt ttaacccttt 60
taattaagaa aacttttatt gatttatttt ttggggggaa attttttagg atctgattct 120
tctgaagata ccgttaataa ggctacttat tgcaggtgag tcaaagagaa cctttgtcta 180
tgaagctggt tttcaaaact gaaaaactca tatattcagt attttactcc cacagcacct 240
ccccccaatt tgacccacag ggacccccat ccaggtgcag ggtcctcgcc tgtgtacagg 300
gcacaccttt ggtcactcca aattcccaga gctcccaggg tccttctcag ggtctccacc 360
tggatg 366
<210> 104
<211> 361
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 104
tggtcagctt tctgtaatcg aaagagctaa aatgtttgat cttggtcatt tgacagttct 60
gcatacatgt aactagtgtt tcttattagg actctgtctt ttccctatag tgtgggagat 120
caagaattgt tacaaatctc ccctcaagga accagggatg aaatcagttt ggattctgca 180
aaaattttca aaactgaaaa actcatatat tcagtatttt actcccacag cacctccccc 240
caatttgacc cacagggacc cccatccagg tgcagggtcc tcgcctgtgt acagggcaca 300
cctttggtca ctccaaattc ccagagctcc cagggtcctt ctcagggtct ccacctggat 360
g 361
<210> 105
<211> 293
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 105
gttgttctag cagtgaagag ataaagaaaa aaaagtacaa ccaaatgcca gtcaggcaca 60
gcagaaacct acaactcttg gaaggtaaag aacctgcaac tggagccaag aagagttttt 120
caaaactgaa aaactcatat attcagtatt ttactcccac agcacctccc cccaatttga 180
cccacaggga cccccatcca ggtgcagggt cctcgcctgt gtacagggca cacctttggt 240
cactccaaat tcccagagct cccagggtcc ttctcagggt ctccacctgg atg 293
<210> 106
<211> 268
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 106
gctagaggaa aactttgagg aacattcaat gtcacctgta agagaaatgg gaaatgagaa 60
cattccaagt acagtgagca caattagccg tttttcaaaa ctgaaaaact catatattca 120
gtattttact cccacagcac ctccccccaa tttgacccac agggaccccc atccaggtgc 180
agggtcctcg cctgtgtaca gggcacacct ttggtcactc caaattccca gagctcccag 240
ggtccttctc agggtctcca cctggatg 268
<210> 107
<211> 282
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 107
taaggtgaag cagcatctgg gtgtgagagt gaaacaagcg tctctgaaga cagctcaggg 60
ctatcctctc agagtgacat tttaaccact caggtaaaaa gcgtgttttc aaaactgaaa 120
aactcatata ttcagtattt tactcccaca gcacctcccc ccaatttgac ccacagggac 180
ccccatccag gtgcagggtc ctcgcctgtg tacagggcac acctttggtc actccaaatt 240
cccagagctc ccagggtcct tctcagggtc tccacctgga tg 282
<210> 108
<211> 279
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 108
gtcatcccct tctaaatgcc catcattaga tgataggtgg tacatgcaca gttgctctgg 60
gagacttcag aatagaaact acccatctca agaggagctc atttttcaaa actgaaaaac 120
tcatatattc agtattttac tcccacagca cctcccccca atttgaccca cagggacccc 180
catccaggtg cagggtcctc gcctgtgtac agggcacacc tttggtcact ccaaattccc 240
agagctccca gggtccttct cagggtctcc acctggatg 279
<210> 109
<211> 367
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 109
ttctgagctg tgtgctagag gtaactcatg ataatggaat atttgattta atttcagatg 60
ctcgtgtaca agtttgccag aaaactccac atcactttaa ctaatctaat tactgaagag 120
actactcatg ttgttatgaa aacaggtata ccaagaacct ttacagaata ccttgcatct 180
gctgcataaa ttttcaaaac tgaaaaactc atatattcag tattttactc ccacagcacc 240
tccccccaat ttgacccaca gggaccccca tccaggtgca gggtcctcgc ctgtgtacag 300
ggcacacctt tggtcactcc aaattcccag agctcccagg gtccttctca gggtctccac 360
ctggatg 367
<210> 110
<211> 357
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 110
aaagagcacg ttcttctgct gtatgtaacc tgtcttttct atgatctctt taggggtgac 60
ccagtcaatt aaagaaagaa aaatgctgaa tgaggtaagt acttgatgtt acaaactaac 120
cagagatatt cattcagtca tatagttaaa aatgtatttg cttccttcca tcaatgcacc 180
ttttcaaaac tgaaaaactc atatattcag tattttactc ccacagcacc tccccccaat 240
ttgacccaca gggaccccca tccaggtgca gggtcctcgc ctgtgtacag ggcacacctt 300
tggtcactcc aaattcccag agctcccagg gtccttctca gggtctccac ctggatg 357
<210> 111
<211> 319
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 111
aagtgacagt tccagtagtc ctactttgac actttgaatg ctctttcctt cctggggatc 60
cagggtgtcc acccaattga ggttgtgcag ccagatgcct ggacagagga caatggcttc 120
catggtaagg tgcctgcatg tattttcaaa actgaaaaac tcatatattc agtattttac 180
tcccacagca cctcccccca atttgaccca cagggacccc catccaggtg cagggtcctc 240
gcctgtgtac agggcacacc tttggtcact ccaaattccc agagctccca gggtccttct 300
cagggtctcc acctggatg 319
<210> 112
<211> 292
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 112
gtggttcggc tttcaccagt ctgtgcgccc tgccatgtgg aagatgatgc tcaacattga 60
tggtgtgtgg ggagagctat ggagccaggg gcaccccaag tccagtgacc acaccttttc 120
aaaactgaaa aactcatata ttcagtattt tactcccaca gcacctcccc ccaatttgac 180
ccacagggac ccccatccag gtgcagggtc ctcgcctgtg tacagggcac acctttggtc 240
actccaaatt cccagagctc ccagggtcct tctcagggtc tccacctgga tg 292
<210> 113
<211> 259
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 113
ggtcagctct tcccttcatc acatacttgg agaacaaagg acaccgttat ccatgctttt 60
tcaacacatt acatgtggga ggttttcaaa actgaaaaac tcatatattc agtattttac 120
tcccacagca cctcccccca atttgaccca cagggacccc catccaggtg cagggtcctc 180
gcctgtgtac agggcacacc tttggtcact ccaaattccc agagctccca gggtccttct 240
cagggtctcc acctggatg 259
<210> 114
<211> 368
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 114
cagagggata ccatgcaaca taacctgata aagctccagc aggaaatggc tgatctagaa 60
gctgtgttag aacagcatgg gagccagcct tctaacagct acccttccat cataagtgac 120
acttctgccc ttgaggacct gcgaaatcca gaacaaagca catcagaaaa aggtgtgtat 180
tgttggccaa attttcaaaa ctgaaaaact catatattca gtattttact cccacagcac 240
ctccccccaa tttgacccac agggaccccc atccaggtgc agggtcctcg cctgtgtaca 300
gggcacacct ttggtcactc caaattccca gagctcccag ggtccttctc agggtctcca 360
cctggatg 368
<210> 115
<211> 366
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 115
tctcctgaac atctaaaaga tgaagtttct atcatccaaa gtatgggcta cagaaaccgt 60
gcctaaagac ttctacagag tgaacccgaa aatccttcct tggtaaaacc atttgttttc 120
ttcttcttct tcttcttctt ttcttttttt tttctttttt ttttttgaga tggagtcttg 180
ctctgtggct tttcaaaact gaaaaactca tatattcagt attttactcc cacagcacct 240
ccccccaatt tgacccacag ggacccccat ccaggtgcag ggtcctcgcc tgtgtacagg 300
gcacaccttt ggtcactcca aattcccaga gctcccaggg tccttctcag ggtctccacc 360
tggatg 366
<210> 116
<211> 360
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 116
ttcaggagga aaagcacaga actggccaac aattgcttga ctgttcttta ccatactgtt 60
tagcaggaaa ccagtctcag tgtccaactc tctaaccttg gaactgagag aactctgagg 120
acaaagcagc ggatacaacc tcaaaagacg tctgtctaca ttgaattggg taagggtctc 180
aggttttcaa aactgaaaaa ctcatatatt cagtatttta ctcccacagc acctcccccc 240
aatttgaccc acagggaccc ccatccaggt gcagggtcct cgcctgtgta cagggcacac 300
ctttggtcac tccaaattcc cagagctccc agggtccttc tcagggtctc cacctggatg 360
<210> 117
<211> 355
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 117
tgtgaatttt ctgagacgga tgtaacaaat actgaacatc atcaacccag taataatgat 60
ttgaacacca ctgagaagcg tgcagcagag aggcatccag aaaagtatca gggtagttct 120
gtttcaaact tgcatgtgga gccatgtggc acaaatactc atgccagctc attacagctt 180
ttcaaaactg aaaaactcat atattcagta ttttactccc acagcacctc cccccaattt 240
gacccacagg gacccccatc caggtgcagg gtcctcgcct gtgtacaggg cacacctttg 300
gtcactccaa attcccagag ctcccagggt ccttctcagg gtctccacct ggatg 355
<210> 118
<211> 371
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 118
aaagcagatt tggcagttca aaagactcct gaaatgataa atcagggtac taaccaaacg 60
gagcagaatg gtcaagtgat gaatattact aatagtggtc atgagaataa aacaaaaggt 120
gattctattc agaatgagaa aaatcctaac ccaatagaat cactcgaaaa agaatctgct 180
ttcaaaacga aagcttttca aaactgaaaa actcatatat tcagtatttt actcccacag 240
cacctccccc caatttgacc cacagggacc cccatccagg tgcagggtcc tcgcctgtgt 300
acagggcaca cctttggtca ctccaaattc ccagagctcc cagggtcctt ctcagggtct 360
ccacctggat g 371
<210> 119
<211> 337
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 119
tttttagcaa aagcgtccag aaaggagagc ttagcaggag tcctagccct ttctcccata 60
cacatttggc tcagggttac cgaagagggg ccaagaaatt agagtcctca gaagagaact 120
tatctagtga ggatgaagag cttccctgct tccaacactt ttttcaaaac tgaaaaactc 180
atatattcag tattttactc ccacagcacc tccccccaat ttgacccaca gggaccccca 240
tccaggtgca gggtcctcgc ctgtgtacag ggcacacctt tggtcactcc aaattcccag 300
agctcccagg gtccttctca gggtctccac ctggatg 337
<210> 120
<211> 368
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 120
cagagggata ccatgcaaca taacctgata aagctccagc aggaaatggc tgatctagaa 60
gctgtgttag aacagcatgg gagccagcct tctaacagct acccttccat cataagtgac 120
acttctgccc ttgaggacct gcgaaatcca gaacaaagca catcagaaaa aggtgtgtat 180
tgttggccaa attttcaaaa ctgaaaaact catatattca gtattttact cccacagcac 240
ctccccccaa tttgacccac agggaccccc atccaggtgc agggtcctcg cctgtgtaca 300
gggcacacct ttggtcactc caaattccca gagctcccag ggtccttctc agggtctcca 360
cctggatg 368
<210> 121
<211> 329
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 121
tttgttttct cattccattt aaagcagtat taacttcaca gaaaagtagt gaatacccta 60
taagccagaa tccagaaggc cattctgctg acaagtttga ggtgtctgca gatagttcta 120
ccagtaaaaa taaagaacca ggagtggaaa ggttttcaaa actgaaaaac tcatatattc 180
agtattttac tcccacagca cctcccccca atttgaccca cagggacccc catccaggtg 240
cagggtcctc gcctgtgtac agggcacacc tttggtcact ccaaattccc agagctccca 300
gggtccttct cagggtctcc acctggatg 329
<210> 122
<211> 284
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 122
aacttctccc attcctttca gagggaaccc cttacctgga atctggaatc tgcctcttct 60
ctgatgaccc tgaatctgat ccttctgaag acagagcccc agagtcattt tcaaaactga 120
aaaactcata tattcagtat tttactccca cagcacctcc ccccaatttg acccacaggg 180
acccccatcc aggtgcaggg tcctcgcctg tgtacagggc acacctttgg tcactccaaa 240
ttcccagagc tcccagggtc cttctcaggg tctccacctg gatg 284
<210> 123
<211> 315
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 123
gctctttagc ttcttaggac agcacttcct gattttgttt tcaacttcta atcctttgag 60
tgtttttcat tctgcagatg ctgagtttgt gtgtgtacgg acactgaaat attttctagg 120
aattgcggga ggaaaatgtt ttcaaaactg aaaaactcat atattcagta ttttactccc 180
acagcacctc cccccaattt gacccacagg gacccccatc caggtgcagg gtcctcgcct 240
gtgtacaggg cacacctttg gtcactccaa attcccagag ctcccagggt ccttctcagg 300
gtctccacct ggatg 315
<210> 124
<211> 333
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 124
tgacgtgtct gctccacttc cattgaagga agcttctctt tctcttatcc tgatgggttg 60
tgtttggttt ctttcagcat gattttgaag tcagaggaga tgtggtcaat ggatgaaacc 120
accaaggtcc aaagcgagca agagaatccc aggacatttt caaaactgaa aaactcatat 180
attcagtatt ttactcccac agcacctccc cccaatttga cccacaggga cccccatcca 240
ggtgcagggt cctcgcctgt gtacagggca cacctttggt cactccaaat tcccagagct 300
cccagggtcc ttctcagggt ctccacctgg atg 333
<210> 125
<211> 286
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 125
ttcccctgtc cctctctctt cctctcttct tccagatctt cagggggcta gaaatctgtt 60
gcaatgggcc cttcaccaac atgcccacag gtaagagcct gggagaacct tttcaaaact 120
gaaaaactca tatattcagt attttactcc cacagcacct ccccccaatt tgacccacag 180
ggacccccat ccaggtgcag ggtcctcgcc tgtgtacagg gcacaccttt ggtcactcca 240
aattcccaga gctcccaggg tccttctcag ggtctccacc tggatg 286
<210> 126
<211> 353
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 126
ggtagagggc ctgggttaag tatgcagatt actgcagtga ttttacatct aaatgtccat 60
tttagatcaa ctggaatgga tggtacagct gtgtggtgct tctgtggtgt aggagctttc 120
atcattcacc cttggcacag taagtattgg gtgccctgtc agagagggag gacacatttt 180
caaaactgaa aaactcatat attcagtatt ttactcccac agcacctccc cccaatttga 240
cccacaggga cccccatcca ggtgcagggt cctcgcctgt gtacagggca cacctttggt 300
cactccaaat tcccagagct cccagggtcc ttctcagggt ctccacctgg atg 353
<210> 127
<211> 292
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 127
gtggttcggc tttcaccagt ctgtgcgccc tgccatgtgg aagatgatgc tcaacattga 60
tggtgtgtgg ggagagctat ggagccaggg gcaccccaag tccagtgacc acaccttttc 120
aaaactgaaa aactcatata ttcagtattt tactcccaca gcacctcccc ccaatttgac 180
ccacagggac ccccatccag gtgcagggtc ctcgcctgtg tacagggcac acctttggtc 240
actccaaatt cccagagctc ccagggtcct tctcagggtc tccacctgga tg 292
<210> 128
<211> 259
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 128
ggtcagctct tcccttcatc acatacttgg agaacaaagg acaccgttat ccatgctttt 60
tcaacacatt acatgtggga ggttttcaaa actgaaaaac tcatatattc agtattttac 120
tcccacagca cctcccccca atttgaccca cagggacccc catccaggtg cagggtcctc 180
gcctgtgtac agggcacacc tttggtcact ccaaattccc agagctccca gggtccttct 240
cagggtctcc acctggatg 259
<210> 129
<211> 262
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 129
agcaacctgt tacatattaa agttttatta tactacattt ttctacatcc tttgttttag 60
ggtgttgatt gcctttgctc agtatttttc aaaactgaaa aactcatata ttcagtattt 120
tactcccaca gcacctcccc ccaatttgac ccacagggac ccccatccag gtgcagggtc 180
ctcgcctgtg tacagggcac acctttggtc actccaaatt cccagagctc ccagggtcct 240
tctcagggtc tccacctgga tg 262