Tim Xiao (@TimZXiao)

Pinned

T

Tim Xiao@TimZXiao · Jun 13

✨ New paper: Flipping Against All Odds We found that large language models (LLMs) can describe probabilities—but fail to sample from them faithfully. Yes, even flipping a fair coin is hard. 🪙 🧵 Here’s what we learned—and how we fixed it. 🔗arxiv.org/abs/2506.09998 1/

TimZXiao's tweet image. ✨ New paper: Flipping Against All Odds

We found that large language models (LLMs) can describe probabilities—but fail to sample from them faithfully.

Yes, even flipping a fair coin is hard. 🪙

🧵 Here’s what we learned—and how we fixed it.

🔗arxiv.org/abs/2506.09998

1/

4

7

15

2

2.0K

Pinned

T

Tim Xiao@TimZXiao · Jun 13

I was surprised when I first saw that the black magic of prompt engineering can marry classical ML methods in such a natural way - simply asking an LLM to do rejection sampling makes it a more rational agent. Cannot wait to see how we may similarly design better "LLM algorithms".

TTim Xiao@TimZXiao · Jun 13

✨ New paper: Flipping Against All Odds We found that large language models (LLMs) can describe probabilities—but fail to sample from them faithfully. Yes, even flipping a fair coin is hard. 🪙 🧵 Here’s what we learned—and how we fixed it. 🔗arxiv.org/abs/2506.09998 1/

0

2

6

0

567

Pinned

T

Tim Xiao@TimZXiao · Jun 13

Verbalized machine learning treats LLMs with prompts as function approximators. Building on this, @TimZXiao came up with the idea of studying whether LLMs can act as samplers. It turns out they’re often biased, even when they appear to understand the target distribution.

TTim Xiao@TimZXiao · Jun 13

✨ New paper: Flipping Against All Odds We found that large language models (LLMs) can describe probabilities—but fail to sample from them faithfully. Yes, even flipping a fair coin is hard. 🪙 🧵 Here’s what we learned—and how we fixed it. 🔗arxiv.org/abs/2506.09998 1/

1

4

0

703

Pinned

T

Tim Xiao@TimZXiao · Jun 13

Great paper by my students @TimZXiao and @johanneszenn and collaborators that applies ideas from Monte Carlo sampling to (black-box) LLM execution to turn LLMs into better calibrated stochastic samplers.

TTim Xiao@TimZXiao · Jun 13

✨ New paper: Flipping Against All Odds We found that large language models (LLMs) can describe probabilities—but fail to sample from them faithfully. Yes, even flipping a fair coin is hard. 🪙 🧵 Here’s what we learned—and how we fixed it. 🔗arxiv.org/abs/2506.09998 1/

0

2

3

0

171

T

Tim Xiao@TimZXiao · Jun 26

Try it out!

WWeiyang Liu@Besteuler · Jun 26

🚀 Meet OFTv2 — Orthogonal Finetuning made scalable, finally. ⚡️ 10× faster 💾 3× less GPU memory 🤖 Quantized OFT: plug-and-play on quantized LLMs, better than QLoRA Try it now on Hugging face PEFT: tinyurl.com/ycxswfe7 Website: spherelab.ai/oftv2/ #AI #LLM 🧵1/6

0

2

0

128

Tim Xiao Retweeted

Z

Zicong Fan🇨🇦@zc_alexfan · Jun 18

Our @ICCVConference HANDS workshop will be on Oct. 20, PM! We focus on hand-related areas, e.g., hand pose est., hand-object interaction, robotics hand manipulation. hands-workshop.org @NUSingapore @CSatETH @unibirmingham @RealityLabs @AIatMeta @UTokyo_News @meshcapade

1

17

34

7

5.0K

T

Tim Xiao@TimZXiao · Jun 18

We have added some new experiments and analyses to the new version of our paper. Check it out here: arxiv.org/abs/2506.08001. We discovered that despite being generalized to spectrum-preserving training, POET can still preserve minimum hyperspherical energy. This property only…

WWeiyang Liu@Besteuler · Jun 11

📢Glad to introduce our paper: Reparameterized LLM Training via Orthogonal Equivalence Transformation (POET)! POET is a new algorithm for efficiently pretraining / finetuning large language models. Its training consists of three geometric phases. spherelab.ai/poet 1/6

0

2

8

1

824

T

Tim Xiao@TimZXiao · Jun 16

Muon is gaining attention for its use of orthogonalization, making it a natural point of comparison with POET. We computed singular value entropy over training steps and find that POET always maintains high entropy. A recent study (arxiv.org/abs/2502.16982) suggests that this is a…

WWeiyang Liu@Besteuler · Jun 11

📢Glad to introduce our paper: Reparameterized LLM Training via Orthogonal Equivalence Transformation (POET)! POET is a new algorithm for efficiently pretraining / finetuning large language models. Its training consists of three geometric phases. spherelab.ai/poet 1/6

0

1

7

1

901

T

Tim Xiao@TimZXiao · Jun 12

Checkout our recent work on efficient pretraining for LLM!

WWeiyang Liu@Besteuler · Jun 11

📢Glad to introduce our paper: Reparameterized LLM Training via Orthogonal Equivalence Transformation (POET)! POET is a new algorithm for efficiently pretraining / finetuning large language models. Its training consists of three geometric phases. spherelab.ai/poet 1/6

0

7

0

203

Tim Xiao Retweeted

K

Katrin Renz@KatrinRenz · May 8

📣 Excited to share our #CVPR2025 Spotlight paper and my internship project @wayve_ai: SimLingo. A Vision-Language-Action (VLA) model that achieves state-of-the-art driving performance with language capabilities. Code: github.com/RenzKa/simlingo Paper: arxiv.org/abs/2503.09594

1

13

34

5

2.0K

Tim Xiao Retweeted

W

Weiyang Liu@Besteuler · May 6

📢Glad to introduce FormalMATH, a large-scale Lean4 benchmark comprising 5,560 formally verified problems. 📖The benchmark spans from high-school Olympiad challenges to undergraduate-level theorems across diverse domains. The best LLM prover only achieved 16.46% accuracy. 1/4

1

5

20

3

6.0K

T

Tim Xiao@TimZXiao · Apr 8

The implementation of Nabla-GFlowNet (#ICLR2025) is released. Welcome to try it out. :)

ZZhen Liu@ItsTheZhen · Apr 8

Code released: github.com/lzzcd001/nabla…. Feel free to check it out!

0

1

6

0

1.0K