The 10th UBC Annual Symposium

"Bioinformatics in Utrecht:
Past 10 years and the future"


Monday, October 7, from 8:30h at the Princess Máxima Center for Pediatric Oncology

The 10th UBC Annual Symposium - Bioinformatics in Utrecht: Past 10 years and the future

We are happy to announce that the 10th UBC annual symposium will take place at the Princess Máxima Center (Utrecht Science Park) on Monday, October 7, 2024. The theme of this year’s edition is “Bioinformatics in Utrecht: Past 10 years and the future”. We will reflect on the compliments of the past decade and look forward to the innovations that lie ahead.

At this yearly event, we aim to bring the bioinformatics community working at the Utrecht Science Park together to exchange knowledge and expertise and discuss the latest progress in the field of bioinformatics.

We strongly encourage you to register (for free) and submit an abstract for a poster presentation at the symposium, to promote your bioinformatics research, gain more exposure, and win a cash prize of 150 euros!

Location:
6th Floor of the Princess Máxima Center for Pediatric Oncology 

Heidelberglaan 25
3584 CS Utrecht

Preliminary Programme

08:30 – 09:00 Registration
09:00 – 09:15 Welcome by the UBC Executive Committee and Prof. dr. Isabel Arends (Chair of the Life Sciences Deans)
09:15 – 10:00 Keynote talk by Prof. dr. Berend Snel
10:00 – 10:30 Coffee break and poster presentations
10:30 – 12:00 Pitch talks Sequencing and Bioinformatics
12:00 – 13:30 Walking lunch and poster presentations
13:30 – 14:15 Keynote talk by Prof. dr. Ana Conesa 
14:15 – 15:45 Pitch talks Modelling and Machine Learning
15:45 – 16:15 Break and poster presentations
16:15 – 17:00 Keynote talk by Prof dr. Erik van Nimwegen
17:00 – 18:00 Closing ceremony followed by drinks

Keynote Talks

In this lecture we will discuss two things. First I will shortly reflect on the past ten years in bioinformatics with a focus on the Utrecht Science Park and the UBC. Subsequently we will discuss our recent investigations in utilizing spliceosomal introns to unlock one of the biggest questions in comparative genomics. Spliceosomal introns are a unique feature of eukaryotic genes. Previous studies have established that many introns were present in the protein-coding genes of the last eukaryotic common ancestor (LECA). Intron positions shared between genes that duplicated before LECA could in principle provide insight into the emergence of the first introns. In this study we use ancestral intron position reconstructions in two large sets of duplicated families to systematically identify these ancient paralogous intron positions. We found that 20-35% of introns inferred to have been present in LECA were shared between paralogs. These shared introns, which likely preceded ancient duplications, were wide spread across different functions, with the notable exception of nuclear transport. Since we observed a clear signal of pervasive intron loss prior to LECA, it is likely that substantially more introns were shared at the time of duplication than we can detect in LECA. The large extent of shared introns indicates an early origin of introns during eukaryogenesis and suggests an early origin of a nuclear structure, before most of the other complex eukaryotic features were established.

Prof. dr. Berend Snel

Berend Snel studied biology at the university of Utrecht specialising in Theoretical Biology and Bioinformatics. He did his PhD project on comperative genome analysis and genome evolution in the group of Peer Bork at EMBL (Heidelberg, Germany). Subsequently he first worked as a normal post-doc and then as a VENI post-doc in Martijn Huynen’s Comparative Genomics group at the CMBI / NCMLS in Nijmegen, Holland. In 2006 he moved to Utrecht University as an Associate Professor. In Utrecht he leads the Evolutionary Genomics and Integrative Bioinformatics group as part of Theoretical Biology and Bioinformatics in the Department of Biology. As of June 2014 he was appointed Professor in Bioinformatics and chair of the Executive Committee of the Utrecht Bioinformatics Center. In 2016 Berend Snel was awarded a VICI grant. Berend has co-authored more than 110 peer reviewed papers which have been cited more than 15.000 times resulting in an H-index of 52.

Long-read sequencing (LRS) technologies have firmly established themselves as viable alternatives to short-read methods in various applications. Over the past five years, LRS platforms such as Oxford Nanopore and Pacific Biosciences have significantly improved in accuracy and throughput, becoming realistic alternatives to Illumina RNA-seq for transcriptomics experiments. Our lab’s pioneering work led to the creation of SQANTI, a widely used tool for the quality control and quantification of long-read transcript models. However, as technologies advance, the need for additional algorithms to support transcriptomics applications of LRS has grown. This includes not only accurate transcript identification and quantification but also analysis and correction of technological biases, experimental design considerations for large experiments, benchmarking resources, data visualization, and functional annotation of full-length transcript variants.

I will present the current suite of SQANTI tools, which provide a comprehensive solution to these data processing needs. For benchmarking, SQANTI-SIM is unique for simulating long-read and orthogonal data with precise control over transcript novelty. This enables the assessment of the identification of both annotated and novel transcript models. BUGSI is a set of universal single-isoform genes that can serve as an internal standard to troubleshoot RNA degradation and library preparation errors. The raw data quality in multi-sample experiments is assessed with SQANTI-reads, which identifies outliers, technology biases, and evaluates whether the data meets quality standards for discovery. SQANTI3 evaluates the performance of transcript reconstruction algorithms to accurately identify transcript models from long reads. The Filter, Rescue, and Requant modules curate transcript models to enhance transcriptome quality and quantification. Our methodologies reveal that lrRNA-seq data exhibit quantification biases distinct from those seen in short-read RNA-seq, necessitating specialized normalization methods. I will also discuss alternative approaches to defining joint transcriptomes in multi-sample experiments and their impact on transcript identification. Finally, I will introduce IsoAnnot, now integrated into the SQANTI framework, which distinguishes productive from unproductive transcripts and annotates them with functional labels to explore the biological impact of alternative splicing.

Prof. dr. Ana Conesa

Ana Conesa is a senior scientist in Bioinformatics and Computational Biology and a research professor at the Institute for Integrative Systems Biology (I2SysBio) at the Spanish National Research Council, leading the Genomics of Gene Expression Lab. She is a Professor in Bioinformatics at the University of Florida and co-founded the Biobam Bioinformatics, a spin-off company specialised in user-friendly software of biologists. She holds an Engineering degree from the Polytechnical University of Valencia (1993) and a PhD in Molecular Microbiology from Leiden University in The Netherlands (2001). Ana has built a scientific program in the development of algorithms and bioinformatics solutions for the analysis of Big Genomics Data, particularly gene expression and their impact on the phenotype. She has conceived and lead the creation of over a dozen of bioinformatics tools that apply to all kind of species, model and non-model organisms, plants, microbiomes, and humans. These include Blast2GO, maSigPro, NOISeq, Paintomics, SQANTI and TAPPAS. Her tools have tens of users world-wide and have received over 25,000 scientific citations. Ana has published over 135 scientific papers and lead international EU (STATegra, DEANN) and US (UF-TEDDY) projects.

Single-cell omics methods promise to revolutionize our understanding of the behavior of biological systems, measurement technologies have undergone spectacular improvements over the last decades, and the number of available multi-omics datasets is increasing rapidly.

However, to realize the promise of these multi-omics data requires rigorous computational methods that can unambiguously quantify what the raw measurements tell us about the underlying biophysical states of individual cells. I will argue that, in this regard, the current state of the single-cell omics analysis field unfortunately provides a rather depressing picture. Most tools subject the data to multiple layers of ad hoc and complex transformations and filters, with countless arbitrarily tunable parameters, and return results as abstract quantities that lack physical interpretation or error bars. Moreover, the fact that the number of proposed analysis methods increases almost as fast as the number of published datasets shows that any consensus on how to analyze such data is yet to emerge in this field.

In this talk I will discuss some of the reasons why rigorous analysis of single-cell omics data is so challenging, taking single-cell RNA-seq as an example, and make some proposals for a way forward that is arguably simpler, more transparent, and directly rooted in our understanding of the underlying biophysics and measurement processes.

Prof dr. Erik van Nimwegen

After studying theoretical physics in Amsterdam, Erik van Nimwegen moved to the United States in 1995, performing his PhD studies at the Santa Fe Institute (SFI) in Santa Fe, New Mexico, and receiving his PhD from the Faculty of Biology at Utrecht University in 1999. After a year of post-doc studies at the SFI, he spent three years as a fellow at the Center of Studies in Physics and Biology at the Rockefeller University, in New York. Since 2003 he has been a Professor of Computational Biology at the Biozentrum of the University of Basel, and group leader at the Swiss Institute of Bioinformatics since 2004. His main research topics concern genome evolution and the function and evolution of the regulatory networks by which cells control gene expression. He develops mathematical models for analyzing how regulatory networks evolve and function, and computational methods for the reconstruction of such networks from large biological data-sets. 

Pitch Talks

Title Speaker PI
Multiomic measurements of tRNAs and ribosome extension dynamics in single cells Mees van der Ent (Hubrecht Institute) Prof. dr. ir. Alexander van Oudenaarden
Searching for robust protein biomarkers in human plasma Isabel Houtkamp (UU) Prof. dr. Sanne Abeln
NanoRCS: Multimodal tumor cell-free DNA profiling using nanopore-based consensus sequencing Li-Ting Chen (UMCU) Dr. ir. Jeroen de Ridder
Comparative transcriptomics reveals a conserved immune response in six mushroom-forming fungi during interaction with their competitors Marieke van Maanen (UU) Dr. Robin Ohm
Fungal biocontrol agents for Striga weed eradication Dr. Khyati Mheta Bhatt (Westerdijk Institute) Dr. Jérôme Collemare
Bioinformatic approaches to decipher lettuce innate immunity Iñigo Bañales Belaunde (UU) Prof. dr. Guido Van den Ackerveken
Title Speaker PI

Quantitative modelling of fate specification in the C. elegans postembryonic M lineage reveals a missing spatiotemporal signal

Dr. Benjamin Planterose Jimenez (UU) Prof. dr. Kirsten ten Tusscher and Dr. Erika Tsingos
Decomposing plasmid evolution using a simple pangenomic language model Dr. Weizhen Xu (UMCU) Dr. Anita Schürch
Methodology for Biomarker Discovery with Reproducibility in Microbiome Data using Machine Learning David Rojas-Velazquez (UU) Dr. Alejandro Lopez-Rincon
M&M: An RNA-seq based Pan-Cancer Classifier for Pediatric Tumors
Fleur Wallis (PMC) Dr. Patrick Kemmeren
Exploring the diversity of fungal plant biomass conversion Dr. Jiajia Li (Westerdijk Institute) Prof. dr. ir. Ronald de vries  and  Dr. Mao Peng
RNA splicing as novel immune regulator Dr. Alejandra Bodelón de Frutos (UMCU) Dr. Julia Drylewicz

Registration