Acessibilidade / Reportar erro

Mosquito long non-coding RNAs are enriched with Transposable Elements

Abstract

Long non-coding RNAs (lncRNAs) lack coding capacity and mounting evidence suggests that they have a regulatory role in diverse organisms. Most knowledge about lncRNAs comes from studies on vertebrates, including a structural association between lncRNAs and transposable elements (TEs). TE sequences are genomic parasites found in all branches of life and are particularly active and abundant in insect genomes. Here we investigate the contribution of TEs to lncRNA biogenesis in Aedes albopictus and Culex quinquefasciatus. We found that a large fraction of lncRNA loci co-occurs with TE loci in both species. Around 40% of A. albopictus and 52% of C. quinquefasciatus lncRNAs show some association with TEs. Most of the lncRNA/TE associations are represented by TE-derived sequences that are expressed as one or all exons of lncRNAs, including five lncRNAs that seem to influence immune-related genes involved in antiviral response. The contribution of TEs to lncRNAs also varies among the different types of TEs. The Gypsi superfamily is particularly enriched in lncRNAs sequences. In sum, this study demonstrates that transposable elements substantially contribute to lncRNAs biogenesis in A. albopictus and C. quinquefasciatus and may have an impact on regulatory modulation in these species.

Keywords:
lncRNAs; transposable elements; mosquitoes; Aedes ; Culex

Mosquitoes of the genera Aedes, Anopheles, and Culex are vectors of several human pathogens. Many studies sought to investigate a range of biological phenomena in these species, such as insecticide resistance and pathogen infection, resulting in thousands of RNA sequencing datasets availableFigshare repository for this article, http://doi.org/10.6084/m9.figshare.15060921.
http://doi.org/10.6084/m9.figshare.15060...
. Some recently published articles used dozens of these datasets to characterize long non-coding RNAs (lncRNA) in mosquitoes (Azlan et al., 2021aAzlan A, Halim MA, Mohamad F and Azzam G (2021a) Identification and characterization of long noncoding RNAs and their association with acquisition of blood meal in Culex quinquefasciatus. Insect Sci 28:917-928.,bAzlan A, Obeidat SM, Das KT, Yunus MA and Azzam G (2021b) Genome-wide identification of Aedes albopictus long noncoding RNAs and their association with dengue and Zika virus infection. PLoS Negl Trop Dis 15:e0008351.). Many lncRNAs are derived from genes that resemble protein-coding genes (mRNA producers). A mature lncRNA may be processed like a true protein-coding gene mRNA having a similar half-life, but normally they undergo different mRNA processing steps (Ulitsky and Bartel, 2013Ulitsky I and Bartel DP (2013) lincRNAs: Genomics, evolution, and mechanisms. Cell 154:26-46.). However, unlike protein-coding gene mRNAs, lncRNAs are not capable or have a limited capacity of producing fully functional proteins (Mercer et al., 2009Mercer TR, Dinger ME and Mattick JS (2009) Long non-coding RNAs: Insights into functions. Nat Rev Genet 10:155-159.). lncRNAs may contain spurious (not translated) or short Open Reading Frames (ORFs), that is, ORFs that might potentially give rise to short polypeptides (<100 amino acids), but these are commonly absent from the species proteome, or if detected, no function can be inferred or characterized (Ulitsky and Bartel, 2013Ulitsky I and Bartel DP (2013) lincRNAs: Genomics, evolution, and mechanisms. Cell 154:26-46.). These small ORFs may have no functional consequences, but in some cases, they may evolve to a more complex polypeptide giving origin to new genes (Ruiz-Orera et al., 2020Ruiz-Orera J, Villanueva-Cañas JL and Albà MM (2020) Evolution of new proteins from translated sORFs in long non-coding RNAs. Exp Cell Res 391:111940.). Therefore, lncRNAs are a group of RNA molecules larger than 200 nucleotides, expressed from intergenic, intronic or exonic loci, that lack true coding capacity to produce functional proteins (Ulitsky and Bartel, 2013Ulitsky I and Bartel DP (2013) lincRNAs: Genomics, evolution, and mechanisms. Cell 154:26-46.; Cech and Steitz, 2014Cech TR and Steitz JA (2014) The noncoding RNA revolution-trashing old rules to forge new ones. Cell 157:77-94.; Dhanoa et al., 2018Dhanoa JK, Sethi RS, Verma R, Arora JS and Mukhopadhyay CS (2018) Long non-coding RNA: Its evolutionary relics and biological implications in mammals: A review. J Anim Sci Technol 60:25.).

Different studies have shown that lncRNAs impact the specimen phenotype in several ways, mainly influencing gene expression, but also regulating chromosome architecture, recruitment of chromatin modifiers, and modulating chromosomal interactions (Bonasio and Shiekhattar, 2014Bonasio R and Shiekhattar R (2014) Regulation of transcription by long noncoding RNAs. Annu Rev Genet 48:433-455.; Yao et al., 2019Yao R-W, Wang Y and Chen L-L (2019) Cellular functions of long noncoding RNAs. Nat Cell Biol 21:542-551.). In addition, lncRNA knockdowns or knockout experiments result in morphological and behavioral changes in different organisms (Mattick, 2018Mattick JS (2018) The state of long non-coding RNA biology. Noncoding RNA 4:17.). Although our knowledge about lncRNA has advanced, there are still several unanswered questions. One example is: from which genomic structures/loci does lncRNA emerge? Strikingly, results from vertebrates showed an important role of transposable elements (TEs) copies in the emergence of lncRNAs (Kelley and Rinn, 2012Kelley D and Rinn J (2012) Transposable elements reveal a stem cell-specific class of long noncoding RNAs. Genome Biol 13:R107.; Kapusta et al., 2013Kapusta A, Kronenberg Z, Lynch VJ, Zhuo X, Ramsay L, Bourque G, Yandell M and Feschotte C (2013) Transposable elements are major contributors to the origin, diversification, and regulation of vertebrate long noncoding RNAs. PLoS Genet 9:e1003470.; Hezroni et al., 2015Hezroni H, Koppstein D, Schwartz MG, Avrutin A, Bartel DP and Ulitsky I (2015) Principles of long noncoding RNA evolution derived from direct comparison of transcriptomes in 17 species. Cell Rep 11:1110-1122.). More recently, studies on plants also show that some intronic lncRNAs overlap with TEs (Pedro et al., 2018Pedro DLF, Lorenzetti APR, Domingues DS and Paschoal AR (2018) PlaNC-TE: A comprehensive knowledgebase of non-coding RNAs and transposable elements in plants. Database (Oxford) 2018:1-7.). Although there are few studies on invertebrates, the TE contribution to lncRNA biogenesis appears to vary widely among insects, from almost no TE sequences overlapping with lncRNA in hymenopterans, to considerable overlap in Drosophila melanogaster (Lopez-Ezquerra et al., 2017Lopez-Ezquerra A, Harrison MC and Bornberg-Bauer E (2017) Comparative analysis of lincRNA in insect species. BMC Evol Biol 17:155.). A recent publication of our group characterized in detail the contribution of TEs to the genome of mosquitoes (Melo and Wallau, 2020Melo ES and Wallau GL (2020) Mosquito genomes are frequently invaded by transposable elements through horizontal transfer. PLoS Genet 16:e1008946.). Here, we leverage such a rich TE dataset to evaluate the TE contribution to lncRNA loci in Aedes albopictus and Culex quinquefasciatus. We detected a broad contribution of some specific TE superfamilies in these two species' lncRNA profiles.

In order to investigate the role of TEs in lncRNA biogenesis in A. albopictus and C. quinquefasciatus, we first updated a previous TE characterization conducted by Melo and Wallau (2020Melo ES and Wallau GL (2020) Mosquito genomes are frequently invaded by transposable elements through horizontal transfer. PLoS Genet 16:e1008946.) using the most up-to-date genome assembly of both species (GCF_006496715.1 and GCF_015732765.1, respectively). Briefly, we used TEdenovo to generate putative TE consensus families; next filters were applied to remove ill assembled consensus (Figure S1 Figure S1 - The workflow diagram of the analysis performed in the study. shows the entire analysis workflow). TE annotation files were generated using the TEannot pipeline (Flutre et al., 2011Flutre T, Duprat E, Feuillet C and Quesneville H (2011) Considering transposable element diversification in de novo annotation approaches. PLoS One 6:e16526.) with custom parameters. We only kept annotated TE copies larger than 200 nucleotides, as this is the minimum size of an lncRNA. In addition, decreasing the TE copy size increases the chance of spurious matches or superfamily misassignments.

We then obtained the available annotation from A. albopictus RefSeq assembly GCF_006496715.1 and combined it with the lncRNA dataset annotated by Azlan et al. (2021bAzlan A, Obeidat SM, Das KT, Yunus MA and Azzam G (2021b) Genome-wide identification of Aedes albopictus long noncoding RNAs and their association with dengue and Zika virus infection. PLoS Negl Trop Dis 15:e0008351.) to generate lncRNA coordinates of this species. The C. quinquefasciatus annotation dataset from RefSeq assembly VPISU_Cqui (GCF_015732765.1) was used as the lncRNA source, we also took the lncRNA dataset annotated by Azlan et al. (2021aAzlan A, Halim MA, Mohamad F and Azzam G (2021a) Identification and characterization of long noncoding RNAs and their association with acquisition of blood meal in Culex quinquefasciatus. Insect Sci 28:917-928.) to compare between the current and previous assembly version (Cpip2). Lastly, we used GffCompare (Pertea and Pertea, 2020Pertea G and Pertea M (2020) GFF utilities: GffRead and GffCompare. F1000Res 9:304.) to find the overlap degree between TEs and lncRNA coordinates in both species.

We found large variability in lncRNA and TE co-occurrence: I - one or multiple TEs can be inserted in introns or exons of a single lncRNA gene; II - TE sequences can overlap intron/exons junctions of a lncRNA gene, or III - a single or multiexon lncRNA can be entirely encompassed by TE coordinates. We found that 37.88% of all A. albopictus lncRNAs genes have some degree of overlap with transposable elements (Figure 1A). While this proportion varies in both versions of C. quinquefasciatus assembly, from 32% (Figure S2 Figure S2 - Cooccurrence of lncRNAs and transposable elements in the previous version of C. quinquefasciatus assembly (CpipJ2). ) in CpipJ2 to 52.24% in VPISU_Cqui (Figure 1B). As the latter was the most recent and showed better contiguity, we followed the analysis focusing on this version. Moreover, it is interesting to note that 10.61% and 17.98% of lncRNAs loci are expressed in the TE neighborhood (maximum distance of 2 kb between a lncRNA locus and a TE copy) in A. albopictus and C. quinquefasciatus respectively. Although not overlapped with a TE region, the expression of these lncRNAs may be modulated by the surrounding TEs transcriptional activity (Lee et al., 2019Lee H, Zhang Z and Krause HM (2019) Long noncoding RNAs and repetitive elements: Junk or intimate evolutionary partners? Trends Genet 35:892-902.).

LncRNAs occurring in introns of coding genes (intronic) and between genes (intergenic) were assessed separately and the difference in TE participation in the biogenesis of these two types of lncRNA was very small. In A. albopictus, 39.34% of intronic lncRNAs and 37.05% of intergenic lncRNAs overlaps with TEs. This variation is smaller in C. quinquefasciatus where 52.86% and 52.36% of intronic and intergenic lncRNAs co-occurs with TEs, respectively. The similarity in the TE/lncRNAs co-occurrence between intronic and intergenic regions differs substantially from vertebrate lncRNAs, where lncRNAs from intergenic and gene-poor regions are richer in TEs than lncRNAs derived from the intronic region and gene-rich part of the genome (Kapusta et al., 2013Kapusta A, Kronenberg Z, Lynch VJ, Zhuo X, Ramsay L, Bourque G, Yandell M and Feschotte C (2013) Transposable elements are major contributors to the origin, diversification, and regulation of vertebrate long noncoding RNAs. PLoS Genet 9:e1003470.).

We also evaluate if the intersection between lncRNA genes and TEs occurs more frequently in introns or exons of lncRNAs. We observed that 77.56% and 85.17% of the lncRNA genes that show some degree of overlap with TEs have at least one exon involved with a TE insertion in A. albopictus and C. quinquefasciatus respectively. This represents 29.37% and 44.49% of all lncRNA expressed in these species. The high fraction of TEs found in lncRNA exons contrasts with some other insects (Lopez-Ezquerra et al., 2017Lopez-Ezquerra A, Harrison MC and Bornberg-Bauer E (2017) Comparative analysis of lincRNA in insect species. BMC Evol Biol 17:155.). Thus, the depletion of TEs in lncRNAs exons, as suggested by previous authors, can not be extrapolated for insects in general. At least, in insect species with a large genome and high density of TEs, the contribution of TEs to the evolution and properties of lncRNA can be as important as for vertebrates. In lncRNA exons, TEs can be directly responsible for the lncRNA activity domains (Johnson and Guigó, 2014Johnson R and Guigó R (2014) The RIDL hypothesis: Transposable elements as functional domains of long noncoding RNAs. RNA 20:959-976.). Besides, they can also control the expression of lncRNAs via the PIWI-piRNA pathway since the TE regions within lncRNAs may be a target of piRNAs derived from other TE copies dispersed in the genome (Wang and Lin, 2021Wang C and Lin H (2021) Roles of piRNAs in transposon and pseudogene regulation of germline mRNAs and lncRNAs. Genome Biol 22:27.).

Interestingly, a significant fraction of TE copies that overlaps with lncRNA genes is derived from LTR retrotransposons. They represent 82,82% and 31,89% of lncRNA exons in A. albopictus and C. quinquefasciatus species (Figure 1C, D). Among these retrotransposons, the Gypsy superfamily stands out. In A. albopictus, Gypsy superfamily alone is responsible for more than half of all TE-derived exons of lncRNA (Figure 1E, Table S1 Table S1 - The proportion of TE-derived lncRNA exons in both species. ), and almost one-quarter in C. quinquefasciatus (Figure 1F, Table S1 Table S1 - The proportion of TE-derived lncRNA exons in both species. ). Strikingly, the proportion of TE superfamily in lncRNA exons does not represent the proportion that TE superfamilies copies found in the mobilome (Chi-squared test, P-value < 2.2e-16) (Figure S3 Figure S3 - Different distribution of TE superfamilies in genome and in lncRNA exons. ). Performing an over-representation analysis using the clusterProfiler package (Yu et al., 2012Yu G, Wang L-G, Han Y and He Q-Y (2012) clusterProfiler: An R package for comparing biological themes among gene clusters. OMICS 16:284-287.), version 3.10.1, we observed that some TE superfamilies are enriched in lncRNA exons of both species (Figure 2B, D). The elements from the Gypsy superfamily represent again one of the most interesting cases. This superfamily shows the highest geneRatio and lower p-values in over-representation analysis among all TE superfamilies of both species (MITE-like elements are not a superfamily, instead they are heterogeneous elements usually derived from DNA transposons that lost their coding region and are flanked by terminal inverted repeats - TIRs). Gypsy elements represent 53.95% of lncRNA exons that overlap with TEs, but only 20.37% of all TE copies found on A. albopictus genome (Figure 2A). This discrepancy increases substantially in C. quinquefasciatus where Gypsy represents 22.38% of lncRNA exons, but only 1.69% of TE copies on the genome, a 13-fold difference (Figure 2C). The over-representation of some LTR retrotransposons has been observed in human and mouse, but not in insects, where studies on some species showed that TEs on lncRNA reflect genomic TE landscapes (Lopez-Ezquerra et al., 2017Lopez-Ezquerra A, Harrison MC and Bornberg-Bauer E (2017) Comparative analysis of lincRNA in insect species. BMC Evol Biol 17:155.). The predominance of a specific TE type in lncRNAs also varies between species, LTR is the main TE order in lncRNA of A. albopictus, as in humans, mice, and insects such as D. melanogaster (Kapusta et al., 2013Kapusta A, Kronenberg Z, Lynch VJ, Zhuo X, Ramsay L, Bourque G, Yandell M and Feschotte C (2013) Transposable elements are major contributors to the origin, diversification, and regulation of vertebrate long noncoding RNAs. PLoS Genet 9:e1003470.; Lopez-Ezquerra, et al., 2017Lopez-Ezquerra A, Harrison MC and Bornberg-Bauer E (2017) Comparative analysis of lincRNA in insect species. BMC Evol Biol 17:155.). On the other hand, DNA transposons are prevalent in lncRNA of C. quinquefasciatus, as in other insects as Tribolium castaneum and Sogatella furcifera (Lopez-Ezquerra et al., 2017Lopez-Ezquerra A, Harrison MC and Bornberg-Bauer E (2017) Comparative analysis of lincRNA in insect species. BMC Evol Biol 17:155.; Chang et al., 2020Chang Z-X, Ajayi OE, Guo D-Y and Wu Q-F (2020) Genome-wide characterization and developmental expression profiling of long non-coding RNAs in Sogatella furcifera. Insect Sci 27:987-997.). This suggests that the predominant TE type in lncRNAs is species or genus-specific and cannot be extrapolated to family or higher taxa levels.

Figure 1 -
Co-occurrence of lncRNAs and transposable elements in Aedes albopictus (A) and Culex quinquefasciatus (B). The overlaps were separated into five types: lncRNA overlap with TEs but in the opposite strand; lncRNA contains one or more TEs inserted in their coordinates; lncRNA presents some partial overlap with TEs, in this case, the intersection occurs in 5’ or 3’ of lncRNA; lncRNAs genes are fully derived from a TE copy insertion, where some represent the total length of the TE and other are formed by a specific region of a TE. The proportion of TE-derived lncRNA exons are shown by TE orders in C and D for A. albopictus and C. quinquefasciatus, respectively. This proportion indicates the prevalence of Gypsy TE in A. albopictus (E) and C. quinquefasciatus (F).

Figure 2 -
Over-representation analysis of TE superfamilies in lncRNA exons. Distribution of superfamilies that are enriched in A. albopictus (A) and C. quinquefasciatus (C), a full list of superfamilies is shown in Figure S3 Figure S3 - Different distribution of TE superfamilies in genome and in lncRNA exons. . The results of the over-representation analysis show that Gypsy and BEL (LTR retroelements) are the most enriched superfamilies in both A. albopictus (B) and C. quinquefasciatus (D). MITE-like elements are not a TE superfamily as they are a heterogeneous group of defective elements that have TIRs.

Azlan et al. (2021bAzlan A, Obeidat SM, Das KT, Yunus MA and Azzam G (2021b) Genome-wide identification of Aedes albopictus long noncoding RNAs and their association with dengue and Zika virus infection. PLoS Negl Trop Dis 15:e0008351.) identified 13 lncRNAs that regulate the expression of immune-related genes involved in arbovirus antiviral defense, here we found that five of these lncRNA genes are composed of TEs sequences. Two lncRNAs have one or more TEs inside their introns (XR_003898565.1, XR_003896231.1). The remaining three are fully comprised into TEs copies boundaries, two of them in the same strand (XR_003894180.1, lncRNA_5135.1) and the other on the opposite strand (XR_002127134.2). These three lncRNAs are derived from LTR retroelements (Gypsy, Bel-Pao, and Copia). Moreover, one out of 13 also presents an LTR in its flanking region. LTR retroelements are known to possess long terminal repeats (LTRs) at the boundaries of their coding region. In addition to complete and fragmented LTR retroelements copies, many solo-LTRs are also found dispersed into the genome. LTRs regions contain RNA polymerase II promoters as well as other regulatory sequence motifs, and several studies have been showing that LTRs are more likely to keep these sites than other TE types, increasing the likelihood of modulating flanking genes expression (Chuong et al., 2017Chuong EB, Elde NC and Feschotte C (2017) Regulatory activities of transposable elements: From conflicts to benefits. Nat Rev Genet 18:71-86.). Furthermore, selective pressure on lncRNAs may facilitate the domestication of LTRs as lncRNA promoters (Thompson et al., 2016Thompson PJ, Macfarlan TS and Lorincz MC (2016) Long terminal repeats: From parasitic elements to building blocks of the transcriptional regulatory repertoire. Mol Cell 62:766-776.).

In order to evaluate if there are any distinct TE age patterns of TE copies co-occurring or not with lncRNA exons, the divergence of each TE copy that overlaps with lncRNA and that do not overlap with lncRNAs was estimated against the TE consensus family using blastn (Figure S1 Figure S1 - The workflow diagram of the analysis performed in the study. ). The divergence of each TE copy to the respective TE family consensus sequence represents the relative age of each copy since its split from the ancestral element-the TE family consensus sequence. That is, younger TE copies had less time to accumulate mutations since its split from the ancestral TE and show a lower divergence to the consensus while older TE copies had more time to accumulate mutations and are characterized by a higher divergence to the consensus. A Wilcoxon Rank Sum test (wilcox.test on R) was used to compare the copy divergence between the two datasets. On average, TE copies on lncRNA exons are most similar to TE consensus sequences (younger copies) than TEs copies that are not co-occurring with lncRNAs (Figure 3) for both species. Around 69% and 60% of TE copies involved with lncRNA exons are younger than TE copies dispersed in the genome of the same family, in A. albopictus (Figure S4 Figure S4 - Paired box plot showing the mean divergence of the copies for each TE family against the TE family consensus for Aedes albopictus. ) and C. quinquefasciatus (Figure S5 Figure S5 - Paired box plot showing the mean divergence of the copies for each TE family against the TE family consensus for Culex quinquefasciatus. ) respectively. Such divergence patterns rise two non-mutually exclusive hypotheses: I - TE copies co-occurring with lncRNA are indeed younger insertions; II - such low divergence is a result of conserved old copies that are being maintained under purifying selection. Ulitsky and Bartel (2013Ulitsky I and Bartel DP (2013) lincRNAs: Genomics, evolution, and mechanisms. Cell 154:26-46.) came to the conclusion that lncRNA exons are generally more conserved than intergenic regions (neutrally evolving), in vertebrates (Ulitsky and Bartel, 2013Ulitsky I and Bartel DP (2013) lincRNAs: Genomics, evolution, and mechanisms. Cell 154:26-46.) giving support for the latter hypothesis. However, at the moment, our results are insufficient to differentiate and test these hypotheses due to the lack of orthologous lncRNA loci between Ae. albopictus and C. quinquefasciatus.

Figure 3 -
Relative age for TE copies inside (blues) and outside (yellow) lncRNA exons of Aedes albopictus (A) and Culex quinquefasciatus (B). Relative age was calculated using the blastn divergence of the copies for each TE family against the TE family consensus. ** means a p-value < 0.01, **** means a p-value < 0.0001.

In summary, our study demonstrates that a significant fraction (~37-52%) of long non-coding RNAs from A. albopictus and C. quinquefasciatus mosquitoes is involved with transposable elements copies. A portion of these TEs may actively contribute to the regulation of the mosquito’s defense against viral pathogens, and probably to other functions in mosquitoes. From this point of view, a fraction of TE copies that gave rise to functional lncRNAs would no longer be considered genomic parasites or selfish elements, but as a host gene that was co-opted for regulatory functions, as shown in previous studies (Chuong et al., 2017Chuong EB, Elde NC and Feschotte C (2017) Regulatory activities of transposable elements: From conflicts to benefits. Nat Rev Genet 18:71-86.). Our study, also shows that there is an over-representation of LTR retrotransposons co-occurring with lncRNAs for both studied species, which differ from previous observations in Anopheles gambiae. Although several TE superfamilies are distributed across species (Petersen et al., 2019Petersen M, Armisén D, Gibbs RA, Hering L, Khila A, Mayer G, Richards S, Niehuis O and Misof B (2019) Diversity and evolution of the transposable element repertoire in arthropods with particular reference to insects. BMC Ecol Evol 19:11.), co-option for regulatory functions as lncRNAs exons appears to be species or genus-specific, which is supported by the fact that the predominant types of TEs in lncRNAs are different among different insects’ species. Finally, the distribution of TEs in different types of lncRNA, as well as their depletion or enrichment in exons of these structures, does not seem to be characteristic of a group of organisms. Following this reasoning, the profile of TE superfamilies present in lncRNAs is substantially different between species and does not seem to depend on the proportion of superfamilies in the mobilome. However, definitive responses to the influence of TE on lncRNAs for the entire Culicidae family of insects will require additional RNA-Seq experiments from various species belonging to different tribes that compose this taxon.

Acknowledgements

We are thankful to the Bioinformatics Core Facility of the Aggeu Magalhães Institute. This study was financed in part by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) - Finance Code 001, and in part by the National Council for Scientific and Technological Development by the productivity research fellowship level 2 for Wallau GL (303902/2019-1) and under the project Universal 2016 (406667/2016-0). Additionally, it was also supported by Oswaldo Cruz Foundation under the project numbers PROEP/IAM (400742/2019-5).

References

  • Azlan A, Halim MA, Mohamad F and Azzam G (2021a) Identification and characterization of long noncoding RNAs and their association with acquisition of blood meal in Culex quinquefasciatus Insect Sci 28:917-928.
  • Azlan A, Obeidat SM, Das KT, Yunus MA and Azzam G (2021b) Genome-wide identification of Aedes albopictus long noncoding RNAs and their association with dengue and Zika virus infection. PLoS Negl Trop Dis 15:e0008351.
  • Bonasio R and Shiekhattar R (2014) Regulation of transcription by long noncoding RNAs. Annu Rev Genet 48:433-455.
  • Cech TR and Steitz JA (2014) The noncoding RNA revolution-trashing old rules to forge new ones. Cell 157:77-94.
  • Chang Z-X, Ajayi OE, Guo D-Y and Wu Q-F (2020) Genome-wide characterization and developmental expression profiling of long non-coding RNAs in Sogatella furcifera Insect Sci 27:987-997.
  • Chuong EB, Elde NC and Feschotte C (2017) Regulatory activities of transposable elements: From conflicts to benefits. Nat Rev Genet 18:71-86.
  • Dhanoa JK, Sethi RS, Verma R, Arora JS and Mukhopadhyay CS (2018) Long non-coding RNA: Its evolutionary relics and biological implications in mammals: A review. J Anim Sci Technol 60:25.
  • Flutre T, Duprat E, Feuillet C and Quesneville H (2011) Considering transposable element diversification in de novo annotation approaches. PLoS One 6:e16526.
  • Hezroni H, Koppstein D, Schwartz MG, Avrutin A, Bartel DP and Ulitsky I (2015) Principles of long noncoding RNA evolution derived from direct comparison of transcriptomes in 17 species. Cell Rep 11:1110-1122.
  • Johnson R and Guigó R (2014) The RIDL hypothesis: Transposable elements as functional domains of long noncoding RNAs. RNA 20:959-976.
  • Kapusta A, Kronenberg Z, Lynch VJ, Zhuo X, Ramsay L, Bourque G, Yandell M and Feschotte C (2013) Transposable elements are major contributors to the origin, diversification, and regulation of vertebrate long noncoding RNAs. PLoS Genet 9:e1003470.
  • Kelley D and Rinn J (2012) Transposable elements reveal a stem cell-specific class of long noncoding RNAs. Genome Biol 13:R107.
  • Lee H, Zhang Z and Krause HM (2019) Long noncoding RNAs and repetitive elements: Junk or intimate evolutionary partners? Trends Genet 35:892-902.
  • Lopez-Ezquerra A, Harrison MC and Bornberg-Bauer E (2017) Comparative analysis of lincRNA in insect species. BMC Evol Biol 17:155.
  • Mattick JS (2018) The state of long non-coding RNA biology. Noncoding RNA 4:17.
  • Melo ES and Wallau GL (2020) Mosquito genomes are frequently invaded by transposable elements through horizontal transfer. PLoS Genet 16:e1008946.
  • Mercer TR, Dinger ME and Mattick JS (2009) Long non-coding RNAs: Insights into functions. Nat Rev Genet 10:155-159.
  • Pedro DLF, Lorenzetti APR, Domingues DS and Paschoal AR (2018) PlaNC-TE: A comprehensive knowledgebase of non-coding RNAs and transposable elements in plants. Database (Oxford) 2018:1-7.
  • Pertea G and Pertea M (2020) GFF utilities: GffRead and GffCompare. F1000Res 9:304.
  • Petersen M, Armisén D, Gibbs RA, Hering L, Khila A, Mayer G, Richards S, Niehuis O and Misof B (2019) Diversity and evolution of the transposable element repertoire in arthropods with particular reference to insects. BMC Ecol Evol 19:11.
  • Ruiz-Orera J, Villanueva-Cañas JL and Albà MM (2020) Evolution of new proteins from translated sORFs in long non-coding RNAs. Exp Cell Res 391:111940.
  • Thompson PJ, Macfarlan TS and Lorincz MC (2016) Long terminal repeats: From parasitic elements to building blocks of the transcriptional regulatory repertoire. Mol Cell 62:766-776.
  • Ulitsky I and Bartel DP (2013) lincRNAs: Genomics, evolution, and mechanisms. Cell 154:26-46.
  • Wang C and Lin H (2021) Roles of piRNAs in transposon and pseudogene regulation of germline mRNAs and lncRNAs. Genome Biol 22:27.
  • Yao R-W, Wang Y and Chen L-L (2019) Cellular functions of long noncoding RNAs. Nat Cell Biol 21:542-551.
  • Yu G, Wang L-G, Han Y and He Q-Y (2012) clusterProfiler: An R package for comparing biological themes among gene clusters. OMICS 16:284-287.

Internet Resources

Associate Editor:

Klaus Hartfelder

Publication Dates

  • Publication in this collection
    24 Jan 2022
  • Date of issue
    2022

History

  • Received
    27 July 2021
  • Accepted
    29 Nov 2021
Sociedade Brasileira de Genética Rua Cap. Adelmio Norberto da Silva, 736, 14025-670 Ribeirão Preto SP Brazil, Tel.: (55 16) 3911-4130 / Fax.: (55 16) 3621-3552 - Ribeirão Preto - SP - Brazil
E-mail: editor@gmb.org.br