Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

$\textit{BlockFormer}$ : Transformer-based inference from interaction maps

About

Inference from interaction maps, such as centromere identification from genome-wide chromosome conformation capture techniques -- notably Hi-C -- can be formulated as a generic inverse problem: infer a set of parameters given a map summarizing pairwise interactions between entities through blocks of variable numbers and sizes. In this work, we introduce a data-driven approach that leverages shared structure between these maps, such as global alignment between localized patterns, while handling the variability in number and size of entities arising in real-world data. Our approach relies on a transformer architecture capable of handling such variability and a custom simulator to generate abundant, yet computationally cheap synthetic data for training. Applied to the problem of centromere localization, the method accurately recovers their genomic positions across a wide range of species of various genome sizes.

Elo\"ise Touron, Pedro L. C. Rodrigues, Julyan Arbel, Nelle Varoquaux, Michael Arbel• 2026

Related benchmarks

TaskDatasetResultRank
Centromere identificationSaccharomyces cerevisiae (S.C.)
Error Rate0.002
8
Centromere identificationLachancea kluyveri (L.K.)
Error Rate0.17
8
Centromere identificationLachancea thermotolerans (L.T.)
Error Rate0.28
4
Centromere identificationSaccharomyces mikatae (S.M.)
Error Rate18
4
Centromere identificationK.L. (Kluyveromyces lactis)
Error Rate0.18
4
Centromere identificationS.P. (Schizosaccharomyces pombe)
Error Rate0.94
4
Centromere identificationA.T. (Arabidopsis thaliana)
Error Rate3.7
4
Centromere identificationP.F.r. (Plasmodium falciparum rings stage)
Error Rate0.18
4
Centromere identificationP.F.s. Plasmodium falciparum schizonts stage
Error Rate0.27
4
Centromere identificationP.F.t. Plasmodium falciparum trophozoites stage
Error Rate0.28
4
Showing 10 of 13 rows

Other info

Follow for update