Arrow Research search

Author name cluster

Roland Eils

Possible papers associated with this exact author name in Arrow. This page groups case-insensitive exact name matches and is not a full identity disambiguation profile.

4 papers
1 author row

Possible papers

4

NeurIPS Conference 2025 Conference Paper

JanusDNA: A Powerful Bi-directional Hybrid DNA Foundation Model

  • Qihao Duan
  • Bingding Huang
  • Zhenqiao Song
  • Irina Lehmann
  • Lei Gu
  • Roland Eils
  • Benjamin Wild

Large language models (LLMs) have revolutionized natural language processing and are increasingly applied to other sequential data types, including genetic sequences. However, adapting LLMs to genetics presents significant challenges. Capturing complex genomic interactions requires modeling long-range global dependencies within DNA sequences, where interactions often span over 10, 000 base pairs, even within a single gene. This poses substantial computational demands under conventional model architectures and training paradigms. Additionally, traditional LLM training approaches are suboptimal for DNA sequences: autoregressive training, while efficient for training, only supports unidirectional sequence understanding. However, DNA is inherently bidirectional. For instance, bidirectional promoters regulate gene expression in both directions and govern approximately 11% of human gene expression. Masked language models (MLMs) enable bidirectional understanding. However, they are inefficient since only masked tokens contribute to loss calculations at each training step. To address these limitations, we introduce JanusDNA, the first bidirectional DNA foundation model built upon a novel pretraining paradigm, integrating the optimization efficiency of autoregressive modeling with the bidirectional comprehension capability of masked modeling. JanusDNA's architecture leverages a Mamba-Attention Mixture-of-Experts (MoE) design, combining the global, high-resolution context awareness of attention mechanisms with the efficient sequential representation learning capabilities of Mamba. The MoE layers further enhance the model's capacity through sparse parameter scaling, while maintaining manageable computational costs. Notably, JanusDNA can process up to 1 million base pairs at single-nucleotide resolution on a single 80GB GPU using its hybrid architecture. Extensive experiments and ablation studies demonstrate that JanusDNA achieves new state-of-the-art performance on three genomic representation benchmarks. Remarkably, JanusDNA surpasses models with 250x more activated parameters, underscoring its efficiency and effectiveness. Code available at https: //anonymous. 4open. science/r/JanusDNA/.

NeurIPS Conference 2023 Conference Paper

Differentiable sorting for censored time-to-event data.

  • Andre Vauvelle
  • Benjamin Wild
  • Roland Eils
  • Spiros Denaxas

Survival analysis is a crucial semi-supervised task in machine learning with significant real-world applications, especially in healthcare. The most common approach to survival analysis, Cox’s partial likelihood, can be interpreted as a ranking model optimized on a lower bound of the concordance index. We follow these connections further, with listwise ranking losses that allow for a relaxation of the pairwise independence assumption. Given the inherent transitivity of ranking, we explore differentiable sorting networks as a means to introduce a stronger transitive inductive bias during optimization. Despite their potential, current differentiable sorting methods cannot account for censoring, a crucial aspect of many real-world datasets. We propose a novel method, Diffsurv, to overcome this limitation by extending differentiable sorting methods to handle censored tasks. Diffsurv predicts matrices of possible permutations that accommodate the label uncertainty introduced by censored samples. Our experiments reveal that Diffsurv outperforms established baselines in various simulated and real-world risk prediction scenarios. Furthermore, we demonstrate the algorithmic advantages of Diffsurv by presenting a novel method for top-k risk prediction that surpasses current methods.

YNIMG Journal 2018 Journal Article

Rigid and non-rigid registration of polarized light imaging data for 3D reconstruction of the temporal lobe of the human brain at micrometer resolution

  • Sharib Ali
  • Stefan Wörz
  • Katrin Amunts
  • Roland Eils
  • Markus Axer
  • Karl Rohr

To understand the spatial organization as well as long- and short-range connections of the human brain at microscopic resolution, 3D reconstruction of histological sections is important. We approach this challenge by reconstructing series of unstained histological sections of multi-scale ( 1. 3 μ m and 64 μ m ) and multi-modal 3D polarized light imaging (3D-PLI) data. Since spatial coherence is lost during the sectioning procedure, image registration is the major step in 3D reconstruction. We propose a non-rigid registration method which comprises of a novel multi-modal similarity metric and an improved regularization scheme to cope with deformations inevitably introduced during the sectioning procedure, as well as a rigid registration approach using a robust similarity metric for improved initial alignment. We also introduce a multi-scale feature-based localization and registration approach for mapping of 1. 3 μ m sections to 64 μ m sections and a scale-adaptive method that can handle challenging sections with large semi-global deformations due to tissue splits. We have applied our registration method to 126 consecutive sections of the temporal lobe of the human brain with 64 μ m and 1. 3 μ m resolution. Each step of the registration method was quantitatively evaluated using 10 different sections and manually determined ground truth, and a quantitative comparison with previous methods was performed. Visual assessment of the reconstructed volumes and comparison with reference volumes confirmed the high quality of the registration result.