Skip to main content
Search
Main content
bioRxiv : the preprint server for biology
Published

The complete genome of the KOLF2.1J reference iPSC line

Authors

Pilar Alvarez Jerez, Arang Rhie, Juhyun Kim, Prajna Hebbar, Sagorika Nag M Tech, Dmitry Antipov, Sergey Koren, Erika Lara, Alexandra Beilina, Nancy F Hansen, Charles Arber, Jimena Zulueta, Peter Wild Crea, Dhairya Patel, Glenn Hickey, Brian Waltz, Laksh Malik, William C Skarnes, Xylena Reed, Rylee Genner, Kensuke Daida, Caroline B Pantazis, Francis Grenn, Mike A Nalls, Kimberley Billingsley, Valentina Fossati, Selina Wray, Michael Ward, Mina Ryten, Andrew B Singleton, Mark R Cookson, Miten Jain, Benedict Paten, Adam M Phillippy, Cornelis Blauwendraat

Abstract

bioRxiv [Preprint]. 2026 Mar 12:2026.03.06.710144. doi: 10.64898/2026.03.06.710144.

ABSTRACT

While induced pluripotent stem cells (iPSCs) have gained popularity in studying neurodegenerative diseases, the heterogeneity of stem cells used across studies impacts cross-study comparison. The iPSC Neurodegenerative Disease Initiative (iNDI) selected the KOLF2.1J cell line and prioritized its use as a reference standard for studying the effects of pathogenic variants on cell biology due to its stability and neutral neurodegenerative disease genetic risk. This cell line, and its derivatives expressing over 100 variants related to Alzheimer's disease, Parkinson's disease, and other neurological diseases, are available for academic and industry access. Current genomic data analyses are limited by the use of a human reference genome that does not capture the complete genetic background of a given iPSC line. While in the future this issue may be partially mitigated by the creation of a comprehensive human pangenome, previous work has shown that generating custom genomes is of value both to characterize the variation present and to serve as a more appropriate genomic reference. Here, we generated and characterized a custom complete genome assembly from KOLF2.1J. Mapping of sequencing reads to a personalized diploid assembly results in more comprehensive mapping compared to traditional linear references (i.e GRCh38). In addition, we provide a comprehensive custom gene annotation along with isoform expression and differential methylation analyses across multiple cell types. The assembly and all additional data is browsable and publicly available. This resource will enable more accurate investigation of the KOLF2.1J cell line and any genomics data generated compared to using traditional generalized references, while also serving as a foundational approach for establishing custom reference assemblies for other high-value iPSC lines.

PMID:42051295 | PMC:PMC13119330 | DOI:10.64898/2026.03.06.710144

UK DRI Authors

Selina Wray profile

Prof Selina Wray

UK DRI Affiliate Member - UCL

Professor of Molecular Neuroscience, UCL

Prof Selina Wray

Prof Mina Ryten

Centre Director & Research Division Lead

Leveraging brain transcriptomics to understand the pathophysiology of Lewy body disease

Prof Mina Ryten