Jacob Schreiber
@jmschreiber91
Guest Scientist @impvienna, Board of Directors @NumFOCUS, incoming prof @UMassGCB. Prev, @StanfordMed @uwcse. Studying genomics, machine learning, and fruit.
The more papers I read for a review article I'm writing about ML pitfalls in genomics, the more my faith is shaken in the results from papers that apply machine learning to methylation arrays. A salty thread. 1/
In vivo mapping of mutagenesis sensitivity of human enhancers nature.com/articles/s4158…
Super excited to be on the way to #ISMB2025 @ISCB_RegSys! Who else is going?
Our work on "Evaluating the representational power of pre-trained DNA language models for regulatory genomics" led by @AmberZqt with help from @NiraliSomia & @stevenyuyy is finally published in Genome Biology! Check it out! genomebiology.biomedcentral.com/articles/10.11…
Do current genomic language models (pre-trained on whole genomes) learn a foundational understanding of biology in the non-coding region of human genomes? A new evaluation led by @AmberZqt suggests not yet! 1/N paper: biorxiv.org/content/10.110…
Today marks the end of en-JUNE-eering, the month where I focused mostly on the nitty gritty of improving genomics ML infrastructure. Here are some of the highlights:
Thank you, Google Flights, for recommending this 10 hour layover in Athens first by "convenience" when trying to find a Frankfurt -> Vienna flight.

This evaluation of DNA design methods is very well written. If you're interested in the field, you should def take a look. Also, glad to see Ledidi performing so well! biorxiv.org/content/10.110…