Cassava disease-resistant related gene MeAHL17 and application thereof
1. A cassava disease resistance related gene MeAHL17 is characterized in that: the CDS nucleotide sequence of MeAHL17 is shown in SEQ ID No. 1.
2. The MeAHL17 protein encoded by the cassava disease resistance related gene MeAHL17 in claim 1, wherein the amino acid sequence of the protein is shown as SEQ ID No. 2.
3. A promoter of a cassava disease resistance related gene MeAHL17 has a nucleotide sequence shown in SEQ ID No. 3.
4. Use of the promoter of the cassava disease resistance-related gene MeAHL17 as defined in claim 1 or the promoter of the MeAHL17 as defined in claim 3 for preventing and treating bacterial wilt disease of cassava.
5. Use of the promoter of the cassava disease resistance-related gene MeAHL17 as set forth in claim 1 or the promoter of the MeAHL17 as set forth in claim 3 in breeding new plant varieties.
6. Use according to claim 5, characterized in that: the plant is cassava.
Background
Cassava (Manihot esculenta Crantz) is an important economic crop and grain crop in tropical and subtropical regions, and plays a significant role in agriculture and industrial production in China. Bacterial wilt of Cassava (CBB) is one of the very serious diseases in the current cassava production, which can cause the cassava yield to be greatly reduced and seriously threaten the income increase of farmers and the sustainable development of the cassava industry.
Therefore, how to effectively excavate the cassava bacterial wilt resistance candidate genes and carry out detailed functional research on the cassava bacterial wilt resistance candidate genes so as to provide important gene resources for improving the cassava bacterial wilt, and the method is a difficult problem to be solved urgently in the current cassava biological breeding field.
Bacterial blight of cassava is an important agronomic trait. Because of the high heterozygosity of the cassava genome, the construction of a segregation population through parent hybridization and the positioning of the bacterial blight disease-resistant gene of the cassava by adopting a traditional map-based cloning method are limited to a great extent; meanwhile, the research foundation of cassava is poor, a transgenic system is not mature, the excavation of the resistance gene of the bacterial wilt of cassava is greatly limited, and no available gene is applied to the genetic improvement of the resistance of the cassava.
Disclosure of Invention
The invention aims to overcome the defects of the prior art and provides a cassava disease resistance related gene MeAHL17 and application thereof, which can provide gene resources and important reference basis for improving the cassava bacterial wilt and provide basis for gene improvement and biological breeding.
In order to achieve the aim, the invention provides a cassava disease resistance related gene MeAHL17, wherein the CDS nucleotide sequence of MeAHL17 is shown as SEQ ID No. 1.
The amino acid sequence of the MeAHL17 protein coded by the cassava disease resistance related gene MeAHL17 is shown as SEQ ID No. 2.
The invention also provides a promoter of the cassava disease resistance related gene MeAHL17, and the nucleotide sequence of the promoter is shown in SEQ ID No. 3.
The invention also provides application of the promoter of the cassava disease resistance related gene MeAHL17 or MeAHL17 in preventing and treating bacterial wilt of cassava.
The invention also provides application of the promoter of the cassava disease resistance related gene MeAHL17 or MeAHL17 in breeding new plant varieties, preferably, the plant is cassava.
The invention has the beneficial effects that:
(1) the invention accurately identifies the function of MeAHL17 in resisting the bacterial wilt of cassava, provides theoretical basis and key genes for improving the bacterial wilt of cassava, and has important application value;
(2) the invention shows that the A/G allelic variation of the promoter region of MeAHL17 is related to the resistance of cassava bacterial wilt disease. By detecting the genotype, the early prediction and the rapid screening of the bacterial wilt resistance of cassava are facilitated, and the disease-resistant breeding process of the cassava is accelerated.
Drawings
FIG. 1 is a diagram of the gene mapping and functional SNP analysis of MeAHL 17;
in the figure, FIG. 1A is a view showing a bacterial wilt disease GWAS analysis of cassava, and a candidate gene is located in the interval of 11.65-11.70Mb of chromosome 2;
FIG. 1B is a comparative analysis chart of candidate gene transcriptome;
FIG. 1C is the MeAHL17 gene model, with the red dashed line indicating the location of the functional SNP in the promoter region of MeAHL 17;
FIG. 1D is a graph comparing the activity of the A-containing and G-containing MeAHL17 promoter regions (300 bp and 600bp upstream of the transcription start site) in a dual luciferase assay, each sample comprising 8 biological replicates;
FIG. 1E is a graph comparing the resistance of cassava to bacterial wilt based on the AA and AG MeAHL17 alleles;
FIG. 2 is a graph of analysis of the disease resistance function of MeAHL 17;
in the figure, figure 2A is a graph of the difference in the expression levels of MeAHL17 in cassava with AG alleles (Yunnan8 and ZM9781) and AA alleles (ZM95308 and ruishi x3) measured by qRT-PCR for 3 biological replicates per sample, data expressed as mean ± standard deviation,. indicates P <0.01, at 0 and 2 days after Xam infection;
FIG. 2B is a graph showing the difference in expression level of MeAHL17 in cassava leaves transformed with pCAMBIA1304 (vector control, VC1), pCAMBIA1304:: MeAHL17 (overexpression, OE), pTRV (vector control, VC2) or pTRV:: MeAHL17(RNA silencing, RNAi) in the above four varieties (wherein ZM95308 and Ruishi X3 both carry AA alleles; Yunnan8 and ZM9781 both carry AG alleles), respectively; expression was measured by qRT-PCR, 3 biological replicates per sample were assayed, and data are expressed as mean ± standard deviation, P <0.05, P < 0.01;
FIG. 2C is a graph of the number of bacteria in cassava leaves transformed with pCAMBIA1304 (vector control, VC1), pCAMBIA1304:: MeAHL17 (overexpression, OE), pTRV (vector control, VC2) or pTRV:: MeAHL17(RNA silencing, RNAi) at 0 and 6 days after infection with Xam; data are expressed as mean ± standard deviation, P <0.05, P <0.01, and 4 biological replicates were determined for each sample.
Detailed Description
The present invention is described in further detail below with reference to specific examples so as to be understood by those skilled in the art.
Example 1 cassava disease resistance-related Gene MeAHL17 screening
1. Whole genome correlation analysis is carried out based on cassava germplasm natural population, and gene intervals related to cassava bacterial wilt disease are rapidly locked
The cassava germplasm natural population comes from an agricultural delirium cassava germplasm resource garden. 299 parts of cassava material was arbitrarily selected, fresh young leaves were frozen in liquid nitrogen, and genomic DNA was extracted using DNeasy Plant Mini kit (Qiagen, Beijing). Each sample was used to construct a pair-end library with an insert size of 500bp using 5. mu.g of genomic DNA. The 150bp paired end reads for each sample were sequenced using the Illumina X-Ten platform. Sequencing reads for each sample were aligned to the cassava SC205 reference genome using the BWA mem v0.7.17 program. The alignment results were ranked and repeatedly labeled using Samtools v1.9 and Picard v 1.94.
After deleting low-quality reads, the reads on the single-ended and double-ended comparison are used for detecting SNP by adopting a GATK toolkit v3.5 process. The HaplotypeCaller module is used to create an original genotype file containing SNPs and indels, and further filter them using the following parameters: "QUAL <2.0| | | QD <2.0| | | MQ <40.0| FS >60.0| | MQRankSum < -12.5| | ReadPosRankSum < -8.0-clusterSize 2-clusterWindowSize 5" and "QD <2.0| FS >200.0| ReadPosRankSum < -20.0". The identified SNPs and indels were annotated using SnpEff v3.6c software. Low quality SNPs were filtered using "Major Allele Frequency (MAF) >0.05 and missing data < 20%", yielding a total of 1,313,775 high quality SNPs.
The bacterial wilt of cassava germplasm is evaluated according to the technical specification (NY/T3005-2016) of the bacterial wilt of cassava in China. The average of three biological replicates was used to determine the level of resistance. Then, the cassava bacterial wilt is taken as a phenotypic character, and genome-wide association analysis (GWAS) is carried out. The AQ matrix generated by ADMIXTURE v1.3.0 was considered to be a fixed effect, and the relatives (K) matrix was constructed using SPAGeDi v1.3a. The corresponding P value threshold of 0.000001 is set to control the whole genome type I error rate.
The results show that: the interval 11.65-11.70Mb of chromosome 2 was significantly associated with cassava bacterial wilt disease (FIG. 1A).
2. Further screening for candidate genes based on comparative transcriptome analysis
Transcriptome data were obtained from the NCBI-SRA database under accession number SRP 045199. After removal of the linker sequence and low quality reads, clean reads were aligned to the SC205 reference genome by HISAT2 v 2.0.4. Gene expression levels were calculated by StringTie v1.3.4d using default parameters. Gene expression levels were normalized using FPKM. The differentially expressed genes were identified using DESeq2 software. Differentially expressed genes are defined as FDR <0.01 and fold change > 2.
Based on the cassava genome annotation, there are 6 candidate genes in the 11.65-11.70Mb interval on chromosome 2. Interestingly, only Sc02g014000 expression was significantly induced by the pathogen under the cassava bacterial wilt disease treatment conditions (fig. 1B).
Therefore, it was concluded that Sc02g014000 is an important candidate gene for regulating bacterial blight of cassava, which encodes a nuclear localization protein containing an AT-hook domain, which was designated MeAHL17 based on gene homology, and the CDS nucleotide sequence of MeAHL17 is shown in SEQ ID No. 1; the amino acid sequence is shown as SEQ ID No. 2; the nucleotide sequence of the promoter of MeAHL17 is shown in SEQ ID No. 3.
Example 2
1. Based on comparative genome analysis, SNP (single nucleotide polymorphism) sites significantly related to gene expression are excavated
Based on genome re-sequencing data, one SNP variation was found in the MeAHL17 promoter region (at-53 bp upstream of the start codon, i.e., 948 site of the nucleotide sequence of the promoter of MeAHL 17) (A/G, FIG. 1C).
To explore the effect of a/G allelic variation on the activity of the MeAHL17 promoter, we performed a dual luciferase assay in cassava protoplasts. MeAHL17 promoter sequences (-1 to-300 bp and-1 to-600 bp) with A or G were cloned and inserted into pGreenII0800-LUC vector and transformed into cassava protoplasts. Relative luciferase activity was determined using a dual luciferase reporter assay kit (RG027, Beyotime, shanghai). The results show that: the G-containing MeAHL17 promoter was more active than the a-containing MeAHL17 promoter (fig. 1D), indicating that a/G allelic variation affects expression of MeAHL 17.
2. A large amount of cassava germplasm analysis finds that A/G variation is obviously associated with cassava bacterial wilt resistance
By screening a large number of cassava germplasm we found that the germplasm carrying the AA allele by MeAHL17 was more sensitive to bacterial blight of cassava than the germplasm carrying the AG allele (fig. 1E). It is deduced from this that allelic variation of A/G in the promoter region regulates the expression of MeAHL17, which in turn influences the resistance of bacterial blight of cassava.
Example 3 overexpression and validation of RNAi function indicate that MeAHL17 plays an important role in the resistance of bacterial blight of cassava
To examine the expression profile of MeAHL17 in response to bacterial wilt (Xam), we collected 4 representative cassava varieties (of which ZM95308 and Ruishi X3 carry AA alleles; Yunnan8 and Yunnan 3) at 0 and 2 days after Xam infestationZM9781 carries an AG allele). Total RNA was isolated using cDNA synthesis kit (K1622, USA) and reverse transcribed. Then, using MeEF1a as internal reference and 2-ΔΔCtThe method identified the relative expression level of MeAHL17 by quantitative real-time PCR (qRT-PCR).
The results show that: the expression level of MeAHL17 carrying the AA allele induced less after Xam treatment compared to the AG allele (fig. 2A).
Subsequently, overexpression and RNAi functional verification were performed simultaneously in the above 4 cassava varieties. To overexpress MeAHL17, the coding sequence of MeAHL17 was first amplified and inserted into the pCAMBIA1304 vector, followed by injection of the agrobacterium strain (GV3101) with recombinant vector or pCAMBIA1304 (control) into cassava leaves. To silence MeAHL17, a MeAHL17 specific region was amplified and cloned into pTRV2 vector, followed by injection of agrobacterium strain (GV3101) with recombinant vector or pTRV2 (control) into cassava leaves along with pTRV 1. After two days of cultivation of the plants transformed with pCAMBIA1304:: MeAHL17 and pCAMBIA1304 or two weeks of cultivation of the plants transformed with pTRV:: MeAHL17 and pTRV, Xam was inoculated into the cassava leaves, and the number of bacteria and the gene expression level were checked after 6 days.
The results show that: xam the number of bacterial populations in the different varieties after overexpression of MeAHL17 was significantly reduced compared to the control, whereas the number of bacterial populations in the MeAHL17 silenced varieties (Yunnan8 and ZM9781) was significantly increased compared to the control (FIGS. 2B-C). These results indicate that MeAHL17 plays an important role in combating bacterial blight of cassava, and that allelic variation of A/G in its promoter region is associated with resistance to bacterial blight.
Other parts not described in detail are prior art. Although the present invention has been described in detail with reference to the above embodiments, it is only a part of the embodiments of the present invention, not all of the embodiments, and other embodiments can be obtained without inventive step according to the embodiments, and the embodiments are within the scope of the present invention.
Sequence listing
<110> research institute of tropical biotechnology of Chinese tropical academy of agricultural sciences
Sanya Research Institute, Chinese Academy of Tropical Agricultural Sciences
<120> cassava disease-resistant related gene MeAHL17 and application thereof
<160> 3
<170> SIPOSequenceListing 1.0
<210> 1
<211> 939
<212> DNA
<213> cassava (Manihot esculenta)
<400> 1
atgaaaggtg aatatgtaga ggcacaccat ccaccaaagc atgaaaacgt cacccctatg 60
aacatgttct ctaaacttca tccccatccc catcaccagc tccctttctc tcagcacttc 120
caactctctc gtgaatctga agatgatgaa actagaagca ccggcgctgc cgccgtaacc 180
accccttccc ctaacaccaa tcccgccacc accacaacac caagccagaa gcagaaaccc 240
accgaaccca atagcagtgc tggcactgat ggtgccagta ttgaagttgt tcgcaggccc 300
aggggtagac cgcctggttc caagaataga cctaaaccac ctgtggtcat tacacgagac 360
cctgaaccag ctatgagtcc ttacattctc gaagtccctg gtggaagcga cgtcgttgaa 420
tcgatatccc gcttttgtcg ccgcaagaac atcggaatct gcgttcttac aggatctggc 480
gccgttacta atgttactct tcgtcagcca tcaacgactc ccgggtctac cattactttc 540
catggaagct tcgatattct atcgctttct gctacgttta tgcctcaacc ggtgtcgcat 600
ccagtgccca acacttttac tatctctctg gcgggtccgc agggccagat cgtgggtggg 660
tttgttgctg gaagcttggt agcagctggt accgtgtatg tcatcgcggc gacgtttaat 720
aaccctagtt atcaccggtt accgggtgaa gatgaaggga ggaactctgg gtctggcggc 780
gagggtcagt ctccgtctgt gtccggtgcc ggtggaggcg gaggagatag tggacatacg 840
caaggtggag gggagtcgtg cgggatggtt atgtacagtt gtcatttgcc ctctgatgta 900
atctgggcac ccactgctag gccaccaccg ccctactga 939
<210> 2
<211> 312
<212> PRT
<213> cassava (Manihot esculenta)
<400> 2
Met Lys Gly Glu Tyr Val Glu Ala His His Pro Pro Lys His Glu Asn
1 5 10 15
Val Thr Pro Met Asn Met Phe Ser Lys Leu His Pro His Pro His His
20 25 30
Gln Leu Pro Phe Ser Gln His Phe Gln Leu Ser Arg Glu Ser Glu Asp
35 40 45
Asp Glu Thr Arg Ser Thr Gly Ala Ala Ala Val Thr Thr Pro Ser Pro
50 55 60
Asn Thr Asn Pro Ala Thr Thr Thr Thr Pro Ser Gln Lys Gln Lys Pro
65 70 75 80
Thr Glu Pro Asn Ser Ser Ala Gly Thr Asp Gly Ala Ser Ile Glu Val
85 90 95
Val Arg Arg Pro Arg Gly Arg Pro Pro Gly Ser Lys Asn Arg Pro Lys
100 105 110
Pro Pro Val Val Ile Thr Arg Asp Pro Glu Pro Ala Met Ser Pro Tyr
115 120 125
Ile Leu Glu Val Pro Gly Gly Ser Asp Val Val Glu Ser Ile Ser Arg
130 135 140
Phe Cys Arg Arg Lys Asn Ile Gly Ile Cys Val Leu Thr Gly Ser Gly
145 150 155 160
Ala Val Thr Asn Val Thr Leu Arg Gln Pro Ser Thr Thr Pro Gly Ser
165 170 175
Thr Ile Thr Phe His Gly Ser Phe Asp Ile Leu Ser Leu Ser Ala Thr
180 185 190
Phe Met Pro Gln Pro Val Ser His Pro Val Pro Asn Thr Phe Thr Ile
195 200 205
Ser Leu Ala Gly Pro Gln Gly Gln Ile Val Gly Gly Phe Val Ala Gly
210 215 220
Ser Leu Val Ala Ala Gly Thr Val Tyr Val Ile Ala Ala Thr Phe Asn
225 230 235 240
Asn Pro Ser Tyr His Arg Leu Pro Gly Glu Asp Glu Gly Arg Asn Ser
245 250 255
Gly Ser Gly Gly Glu Gly Gln Ser Pro Ser Val Ser Gly Ala Gly Gly
260 265 270
Gly Gly Gly Asp Ser Gly His Thr Gln Gly Gly Gly Glu Ser Cys Gly
275 280 285
Met Val Met Tyr Ser Cys His Leu Pro Ser Asp Val Ile Trp Ala Pro
290 295 300
Thr Ala Arg Pro Pro Pro Pro Tyr
305 310
<210> 3
<211> 1000
<212> DNA
<213> cassava (Manihot esculenta)
<400> 3
tatctcttat attaattctt ttgatataaa tttatatgtt tattttaatt ttattaattt 60
tcttgatata aatttgtatg tttattttaa ttttaacttt ttatcgtaca tccaaagctt 120
gaatatgttt ttctatgaat tgtactgttt tttctcattc tatgaaattt cctttatttt 180
tagggaaaaa aaaaaaagag cctgagtgtt tttgaatgtg ttttcatttt caatatacga 240
aatgggtgtt caaggctggc tttctctcta gagactacta gaaatggaaa atttgaaaag 300
ccatgatgat ggaatagtga tagagagaga aattgaaaat tagagagaag atgcataaag 360
agcattcccc cataacccga atgataaaac atcaacttgt cattgtcaat ccactcggtc 420
ggggatgctc taaaggactc ttcacatttg ctttctcttc ttactctcct ttaaatttca 480
ataattcttc agagaaaata aaaaaaaaaa aaaaaccaga tcatctgttt ctaatcaatc 540
tttttataat ctactaataa aaccaaaatc ctctaaggcc aaagggggta agtgctttga 600
gttagctaac caaatctgtc tcttcgtgga ccctaccctt caacagttct accttcccaa 660
ttccagcggc gctattagtc tttacttggt ttccaagttc acccaccttc tctttaactc 720
ttctcccact tcacatttac aattacaccc acctcttcct ctccttaaat tctttaaaca 780
tatctaagaa ccatccttct cttttcttcc tcgttgccca tcctcatcct cttccatatc 840
tatccatctt tgtaatttat cttcttcttc ttcttcttct tcttcttctt ccattttttt 900
attatttatt attgatcttg tccgaccaag caatttgcaa atatcctrcc aattataagc 960
catacttgcc aaaaatagag attataaaat aacgcaaaac 1000