Hayley Ross
@HayleyRossLing
PhD Student at @Harvard, working on computational semantics and LLM interpretability
The tool-calling decision-making dataset I developed during my internship at NVIDIA is now out! Catch me presenting it at NAACL with @ameyasm1154 or see the 🧵 for details
Check out a new dataset When2Call for training and evaluating LLMs on decision making about "when (not) to call" functions! 📄 Paper: aclanthology.org/2025.naacl-lon… 🤗 HF Dataset Hub: huggingface.co/datasets/nvidi… 💾 GitHub: github.com/nvidia/WHen2Ca… #NAACL2025
New preprint with @najoungkim & @TeaAnd_OrCoffee on fake reefs and other cases of (novel) adjective-noun composition: lingbuzz.net/lingbuzz/008012 Whether a fake N is an N or not depends on noun + context, but people handle novel AN pairs just fine 🙂 Stay tuned for results on LLMs!
