Global analysis of inverted repeat sequences in human gene promoters reveals their non-random distribution and association with specific biological pathways

Časopis: Genomics, 112(4), 2772–2777
Autoři: Brázda V., Bartas M., Lýsek J., Coufal J., Fojta M.
Rok: 2020
ISBN: 1089-8646

Abstrakt

Inverted repeats (IR) play important roles in specific DNA-dependent processes in simple prokaryotes to complex eukaryotes. They are recognized by a variety of proteins including restriction enzymes, helicases and transcription factors. We evaluate the presence and localization of IRs in all validated human promoter sequences within 1000 bp upstream and downstream of the transcription start site (TSS). The occurrence of 7 bp and longer IRs is located non-randomly in promoter regions, with enrichment within 200 bp upstream of the TSS. The highest frequency of IRs is just before TSS for repeats of 8 bp or longer. A comparison of promoters divided according to the occurrence of five individual promoter motifs shows unique location patterns of IRs. Principal component analyses and hierarchical clustering of IRs abundance demonstrated that they are depleted and/or not enriched in the promoters of stably expressed genes, but show significant enrichments for specific dynamically regulated biological pathways.

Full paper