Shiyu Ni@ACL 2025

@Shictyu

✈️ Vienna Ph.D candidate at the Institute of Computing Technology, Chinese Academy of Sciences | NLP; IR

Joined November 2022

99Following

26Followers

Pinned

Shiyu Ni@ACL 2025 Retweeted

Yuchen Wen@YuchenWen1027 · Jun 19

😎Our paper “Evaluating Implicit Bias in Large Language Models by Attacking From a Psychometric Perspective” is accepted to #acl2025 w/@bikeping etc. We propose a psychometric-inspired framework to induce and evaluate implicit bias in LLMs. Project webpage:yuchenwen1.github.io/ImplicitBiasEv…

161

Shiyu Ni@ACL 2025 Retweeted

Run-Ze Fan@Vfrz525_ · Jul 23

🚨 New release: MegaScience The largest & highest-quality post-training dataset for scientific reasoning is now open-sourced (1.25M QA pairs)! 📈 Trained models outperform official Instruct baselines 🔬 Covers 7+ disciplines with university-level textbook-grade QA 📄 Paper:…

251

136

19.0K

Shiyu Ni@ACL 2025 Retweeted

SIGIR-AP 2025@ACMSIGIR_AP · Apr 11

The official website for SIGIR-AP 2025 is now live! Please visit: sigir-ap.org/sigir-ap-2025. This year, we are also inviting industry papers. We also encourage authors of unsuccessful SIGIR submissions to consider submitting to SIGIR-AP. We look forward to seeing you in Xi'an!

4.0K

Shiyu Ni@ACL 2025 Retweeted

Zhijiang Guo@ZhijiangG · Feb 26

🚀Exciting to see how recent advancements like OpenAI’s O1/O3 & DeepSeek’s R1 are pushing the boundaries! Check out our latest survey on Complex Reasoning with LLMs. Analyzed over 300 papers to explore the progress. Paper: arxiv.org/pdf/2502.17419 Github: github.com/zzli2022/Aweso…

158

12.0K

Shiyu Ni@ACL 2025 Retweeted

Weiwei Sun@sunweiwei12 · Nov 10

💡Check MAIR at #EMNLP2024 A large-scale IR benchmark! Highlights: - Task Diversity: 126 realistic tasks, 8x than BEIR 📈 - Domain Coverage: 6 domains and heterogeneous sources 📚 - Instruction Following: 805 relevance criterions - Lightweight & Fast: optimized data sampling ⚡️

4.0K

Shiyu Ni@ACL 2025 Retweeted

Run-Ze Fan@Vfrz525_ · Jul 29

Why does GPT-4o Mini Outperform Claude 3.5 Sonnet on LMSys? Formatting is important. Half a year ago, we started studying how response formats impact AI. We found that just changing the format boosts both readability and mathematical reasoning.

220

150

33.0K

Shiyu Ni@ACL 2025@Shictyu · Jun 12, 2024

A Systematic Survey of Prompting Techniques

AAK@_akhaliq · Jun 12, 2024

The Prompt Report A Systematic Survey of Prompting Techniques Generative Artificial Intelligence (GenAI) systems are being increasingly deployed across all parts of industry and research settings. Developers and end users interact with these systems through the use of

108

Shiyu Ni@ACL 2025@Shictyu · Jun 7, 2024

Teach LLMs to express truthful confidence levels in “long-form generations”!

NNeil Band@neilbband · Jun 6, 2024

When LLMs are unsure, they either hallucinate or abstain. Ideally, they should clearly express truthful confidence levels. Our #ICML2024 work designs an alignment objective to achieve this notion of linguistic calibration in *long-form generations*. arxiv.org/abs/2404.00474 🧵