Basecamp Research
@Basecamp_Res
Beyond Known Biology: The world's largest foundational biodiversity database for training AI BCR
🧬 News alert: We’re bringing BaseData out of stealth — the world’s largest and fastest growing biodiscovery dataset, built in collaboration with scientists across 26 countries. 🔍 BaseData adds 9.8 billion newly discovered protein sequences to the known tree of life — expanding…

Basecamp Research is mapping the tree of life: digitizing nature to build the foundational data biology never had. AI models trained on biological data have hit the data wall. To test if scaling laws still apply, we need more biodiversity and full coverage of the design space.…
🍃 🧬 Basecamp Research is unlocking nature’s blueprint to accelerate breakthroughs in drug discovery, R&D, and beyond. With #AI accelerated by Microsoft Azure and NVIDIA, they’re creating one of the world’s largest biological databases—fueling scientific innovation across…
Join experts from @Basecamp_Res, @emblebi and @origlobalcloud in this live panel as they explore how sovereign compute can help build and deploy performant AI for life sciences with full compliance. 📅July 22 | 11AM EDT 📍Zoom Register here 👉 lu.ma/fodehpto
🚀 @Basecamp_Res’ proprietary dataset tops 10 billion novel protein sequences. In partnership with @NVIDIAAI AI, we're building the next generation of biological foundation models—tools to advance drug development and bioengineering. Read how we got here:…
"One of the crown jewels in the tech bio community in Europe." 👑 That’s how @NVIDIAHealth's Rory Kelleher introduced @Basecamp_Res at #GTCParis x #VivaTech. We’re teaming up with @NVIDIAAI to train a new class of biological foundation models on our 9.8B-sequence BaseData —…

If you are interested in 💻🧬⛰️ we are looking for a bioinformatics intern at @Basecamp_Res - come join us to learn about bioinformatics, metagenomics, and biodiscovery basecamp-research.homerun.co/bioinformatics…
🌍 @Basecamp_Res has announced a series of new global biodiscovery partnerships—Malawi, Hungary, @Scripps_Ocean and @veritree —to accelerate the growth of it 9.8B protein sequence database and AI for drug discovery. Co-founders @glen_gowers & Oliver Vince news, shared the news…

“One of the most exciting things I’ve seen in a long time,” @genentech's @nc_frey comments to @newscientist on @Basecamp_Res' 9.8B protein sequence database. BaseData expands the tree of life 10x beyond public data — and now we’re using it to train a new class of foundation…

Another great example of the UK's ability to develop unique data to push forward AI for bio infrastructure Props to @Basecamp_Res for launching the world's largest biodiscovery dataset, over 10x the number of protein sequences than are currently publicly available 🧬🚀
A UK biotech firm spent years gathering genetic data that has uncovered 1 million previously unknown microbial species and billions of newly identified genes – but even this trove of data may not be enough to train an AI biologist newscientist.com/article/248432…
.@Basecamp_Res just unveiled BaseData: the world’s most diverse biological dataset, built from over 1 million novel species and 9.8 billion protein sequences. It’s a major leap forward for AI-powered biology—fueling breakthroughs in therapeutics, sustainability, and beyond. More…
🧬 Biopharma’s biggest AI challenge isn’t tech — it’s data. 🌍 99.99% of life on Earth remains unmapped. That’s the bottleneck. At #BIO2025, Basecamp Research CSO John Finn is joining a panel moderated by @NVIDIAAI’s David Ruau (@druau) diving into planet-scale biology: creating…
