Martina Vilas

@martinagvilas

CS PhD Student working on AI interpretability. Intern at @MSFTResearch AI Frontiers.

Frankfurt, Germany

Joined April 2017

825Following

1KFollowers

Pinned

Martina Vilas@martinagvilas · Jun 7, 2024

New position paper accepted at #ICML2024! 💻🧠 "An Inner Interpretability Framework for AI Inspired by Lessons from Cognitive Neuroscience" We show how many of the problems/discussions in the AI Inner Interpretability field are similar to those in Cognitive Neuroscience (1/2)

martinagvilas's tweet image. New position paper accepted at #ICML2024!

💻🧠 "An Inner Interpretability Framework for AI Inspired by Lessons from Cognitive Neuroscience"

We show how many of the problems/discussions in the AI Inner Interpretability field are similar to those in Cognitive Neuroscience (1/2)

286

226

43.0K

Martina Vilas Retweeted

Neel Joshi@neelsj · Jun 24

My team at Microsoft Research, working in multimodal, AI is hiring! Please apply if you are interested in working at the cutting edge of multimodal generative AI. jobs.careers.microsoft.com/global/en/job/…

2.0K

Martina Vilas@martinagvilas · Jun 3

Excited to have joined @MSFTResearch in Redmond as a PhD intern with the AI Frontiers Evaluation and Understanding team! This summer I'll be working on the interpretability of AI reasoning models, exploring how we can better understand, assess and control their behavior 💻🧠

martinagvilas's tweet image. Excited to have joined @MSFTResearch in Redmond as a PhD intern with the AI Frontiers Evaluation and Understanding team!

This summer I'll be working on the interpretability of AI reasoning models, exploring how we can better understand, assess and control their behavior 💻🧠

892

Martina Vilas@martinagvilas · Apr 27

I'm very excited that this work was accepted for an oral presentation @naacl! Come by at 10:45 on Thursday to hear how we can use mechanistic interpretability to better understand how LLMs incorporate context when answering questions.

MMichael Lepori@Michael_Lepori · Oct 14

The ability to properly contextualize is a core competency of LLMs, yet even the best models sometimes struggle. In a new preprint, we use #MechanisticInterpretability techniques to propose an explanation for contextualization errors: the LLM Race Conditions Hypothesis. [1/9]

1.0K

Martina Vilas@martinagvilas · Apr 18

We will be presenting this 💫 spotlight 💫 paper at #ICLR2025. Come say hi or DM me if you're interested in discussing AI #interpretability in Singapore! 📆 Poster Session 4 (#530) 🕰️ Fri 25 Apr. 3:00-5:30 PM 📝 openreview.net/forum?id=QogcG… 📊 iclr.cc/virtual/2025/p…

martinagvilas's tweet image. We will be presenting this 💫 spotlight 💫 paper at #ICLR2025. Come say hi or DM me if you're interested in discussing AI #interpretability in Singapore!

📆 Poster Session 4 (#530)
🕰️ Fri 25 Apr. 3:00-5:30 PM
📝 openreview.net/forum?id=QogcG…
📊 iclr.cc/virtual/2025/p…

212

111

16.0K

Martina Vilas Retweeted

Cohere Labs@Cohere_Labs · Nov 27

December 5th, join @mathildepapillo and our ML Theory group as they dive into "Beyond Euclid: An Illustrated Guide to Modern Machine Learning with Geometric, Topological, and Algebraic Structures." 🤔

8.0K

Martina Vilas@martinagvilas · Nov 19

I’m also on Bluesky now 🦋martinagvilas.bsky.social

164

Martina Vilas Retweeted

Dario Zanca@dariozanca · Sep 29

Happening now! Visit @TiezziMatteo and me at the @eccvconf #hcv workshop if you are interested in visual #attention models and human-inspired #XAI @FAU_MaD_Lab @UniFAU @expertdotai @TU_Muenchen @goetheuni @IITalk @PAVIS_IIT @HelmholtzMunich @BjoernEskofier @gemmarono

1.0K

Martina Vilas Retweeted

Sukrut Rao@sukrutrao · Sep 26

Less than three days to go for the eXCV Workshop at #ECCV2024! Join us on Sunday from 14:00-18:00 in Brown 1 to hear about the state of XAI research from an exciting lineup of speakers! @orussakovsky, @vidal_rene, @sunniesuhyoung, @YGandelsman, @zeynepakata @eccvconf (1/4)

21.0K

Martina Vilas@martinagvilas · Sep 17

Super excited to start this mini cohort on geometric deep learning!!! The question about “why use convolutions instead of something else?” is actually one of the questions I wondered myself, that eventually led me to learn about GDL.

CCohere Labs@Cohere_Labs · Sep 17

Have you ever wondered, "why use Convolutions instead of something else?" 🤔 Starting next week, join @aniervs and @martinagvilas with our Open Science Community’s ML Theory group for a 9 week cohort on Geometric Deep Learning!

6.0K

Martina Vilas@martinagvilas · Sep 11

✨Be sure to join us tomorrow! This is going to be really fun :) Our speakers will be pitching ideas from a wide range of ML sub-fields : ML theory, NLP interpretability, ML fairness and ML + Physical Sciences ✨

CCohere Labs@Cohere_Labs · Sep 11

Tomorrow 9am PST/12pm EST join @martinagvilas @SurbhiGoel_ @AbdelZayed1 and Shruti Mishra with our community-led Research Connections group where the experienced researchers will pitch research ideas and invite collaboration.

3.0K

Martina Vilas Retweeted

Sara Hooker@sarahookr · Aug 29

Applications for @CohereForAI scholars program close tomorrow. Something special about the program is our commitment that a research scientist or engineer will read every application.

148

31.0K

Martina Vilas Retweeted

Cohere Labs@Cohere_Labs · Aug 15

August 24th @martinagvilas , will be giving an exciting presentation with our community-led NLP group on "Probing the representations and capacities of Vision-Language Models," be sure to check it out! 🤩 Learn more: cohere.com/events/cohere-…

3.0K

Martina Vilas@martinagvilas · Jul 18, 2024

I’ll be presenting this work next week at #ICML👇 Come by or send a DM if you are interested in discussing these topics in Vienna! 📌 Hall C 4-9 #3004 🕜 Tuesday, July 23rd, at 11:30 a.m. - 1:00 pm CEST

MMartina Vilas@martinagvilas · Jun 7, 2024

6.0K

Martina Vilas Retweeted

Cohere Labs@Cohere_Labs · Jul 1, 2024

This Thursday, check out the presentation on "LLM Processes: Numerical Predictive Distributions Conditioned on Natural Language" with James Requeima! Learn more: cohere.com/events/cohere-… Thank you @aniervs & @martinagvilas for organizing this talk! 👏

1.0K

Martina Vilas Retweeted

Cohere Labs@Cohere_Labs · Apr 15, 2024

Our community-led ML Theory Group is looking forward to hosting @PetarV_93, Research Scientist at @GoogleDeepMind, next week on Thursday, April 25th for a presentation on "Categorical Deep Learning. An Algebraic Theory of Architectures." Learn more: cohere.com/events/c4ai-Pe…

20.0K

Martina Vilas Retweeted

Sasha Luccioni, PhD 🦋🌎✨🤗@SashaMTL · Jan 11, 2024

Every time I get *yet another* rejection of my work analyzing existing models/datasets (because it "lacks novelty"), I worry that our obsession with novelty in ML will make us repeat the same mistakes, without ever understanding why.

231

22.0K

Martina Vilas Retweeted

Eleonora Poeta@poeta_eleonora · Dec 21, 2023

🚀 Exciting News! 📚 Unveiling our latest Paper: "Concept-based Explainable Artificial Intelligence: A Survey" Delve into the world of #Concept-based #XAI with our comprehensive review of concept-based approaches! 🌐 @CiraGabriele @eliana__pastor arxiv.org/abs/2312.12936

4.0K