Skip to main content
Search
Main content
medRxiv : the preprint server for health sciences
Published

Gene-Pseudogene Inversions as a Hidden Source of Missing Heritability

Authors

Ilaria Quartesan, Stefano Facchini, Arianna Manini, Ricardo Parolin Schnekenberg, Chiara Pisciotta, Stefania Magri, Sara Negri, Carlo Gaetano, Adriana Rebelo, Jacquelyn Schatzman Raposo, Radim Mazanec, Riccardo Curro, Natalia Dominik, Stephanie Efthymiou, Matilde Laurà, Tiffany Grider, Shawna Me Feely, Vera Fridman, Alessandro Bertini, Gustavo Maximiano Alves, Lucia Ferullo, Arianna Ghia, Claudio Caccia, Francesca Balistreri, Paola Saveri, Luca Crivellari, Isabella Moroni, Federica Rachele Danti, Tiziana Mongini, Franco Taroni, Michaela Auer-Grumbach, Enrico Bugiardini, James N Sleigh, Arianna Tucci, Henry Houlden, Petra Laššuthová, Pavel Seeman, Anna Basile, Elisa Giorgio, Michael E Shy, Stephan Zuchner, Mary M Reilly, Davide Pareyson, Andrea Cortese

Abstract

medRxiv [Preprint]. 2025 Oct 7:2025.10.01.25336578. doi: 10.1101/2025.10.01.25336578.

ABSTRACT

Historically defined as non-functional copies of coding genes, pseudogenes are an abundant yet underexplored element in the human genome, despite growing evidence linking them to human diseases. From a genome-wide screen, we identified 411 gene-pseudogene pairs located in opposite orientation, an arrangement which is permissive for the occurrence of inversions, including 46 genes already associated with human disease. Next, by analysing long read sequencing (LRS) data from the 1000 Genomes Project, we confirmed that at least 3.6% of healthy individuals carry an inversion involving one of these gene/pseudogene pairs, while they were previously undetected by short read sequencing. Most importantly, we identified novel and recurrent inversions between SORD and its pseudogene SORD2P in 13 out of 151 patients (9%) affected by SORD-related Charcot-Marie-Tooth (CMT) neuropathy, including 6 out of 8 (75%) of SORD-CMT cases where only one pathogenic variant was identified on short read sequencing, making it the third most common pathogenic allele causing SORD-CMT. Of interest, gene-pseudogene pairs displaying chromatin contact in Micro-C data, including SORD/SORD2P, were found to be more likely to undergo inversion events. Overall, our results highlight gene-pseudogene inversions as a previously underrecognized type of pathogenic structural variant. Wider use of LRS could reveal their true prevalence and contribution to the missing heritability in Mendelian diseases.

PMID:41282915 | PMC:PMC12632684 | DOI:10.1101/2025.10.01.25336578

UK DRI Authors

Henry Houlden profile

Prof Henry Houlden

UK DRI Affiliate Member

Professor of Neurology University College London, Department of Neuromuscular Diseases

Prof Henry Houlden