Günter Klambauer (@gklambauer)

Pinned

G

Bio-xLSTM: Generative modeling, representation and in-context learning of biological and chemical sequences xLSTM also shines for DNA, proteins and small molecules -- can handle large-range interactions and huge context! P: arxiv.org/abs/2411.04165

gklambauer's tweet image. Bio-xLSTM: Generative modeling, representation and in-context learning of biological and chemical sequences

xLSTM also shines for DNA, proteins and small molecules -- can handle large-range interactions and huge context!

P: arxiv.org/abs/2411.04165

0

40

153

57

18.0K

G

Günter Klambauer@gklambauer · Jul 10

Since 1990, we have worked on artificial curiosity & measuring „interestingness.“ Our new ICML paper uses "Prediction of Hidden Units" loss to quantify in-context computational complexity in sequence models. It can tell boring from interesting tasks and predict correct reasoning.

VVincent Herrmann@idivinci · Jul 9

Excited to share our new ICML paper, with co-authors @robert_csordas and @SchmidhuberAI! How can we tell if an LLM is actually "thinking" versus just spitting out memorized or trivial text? Can we detect when a model is doing anything interesting? (Thread below👇)

12

58

364

179

35.0K

Günter Klambauer Retweeted

R

Ricardo Buitrago@rbuit_ · Jul 7

Despite theoretically handling long contexts, existing recurrent models still fall short: they may fail to generalize past the training length. We show a simple and general fix which enables length generalization in up to 256k sequences, with no need to change the architectures!

5

33

194

118

39.0K

Günter Klambauer Retweeted

S

Sepp Hochreiter@HochreiterSepp · Jul 5

xLSTM for Aspect-based Sentiment Analysis: arxiv.org/abs/2507.01213 Another success story of xLSTM. MEGA: xLSTM with Multihead Exponential Gated Fusion. Experiments on 3 benchmarks show that MEGA outperforms state-of-the-art baselines with superior accuracy and efficiency”

0

5

29

12

2.0K

Günter Klambauer Retweeted

J

Jürgen Schmidhuber@SchmidhuberAI · Jul 2

10 years ago, in May 2015, we published the first working very deep gradient-based feedforward neural networks (FNNs) with hundreds of layers (previous FNNs had a maximum of a few dozen layers). To overcome the vanishing gradient problem, our Highway Networks used the residual…

4

35

267

69

22.0K

Günter Klambauer Retweeted

S

Sepp Hochreiter@HochreiterSepp · Jul 1

xLSTM for multivariate time series anomaly detection: arxiv.org/abs/2506.22837 “In our results, xLSTM showcases state-of-the-art accuracy, outperforming 23 popular anomaly detection baselines.” Again, xLSTM excels in time series analysis.

1

26

157

77

12.0K

G

Günter Klambauer@gklambauer · Jun 26

Great application but built on the wrong model architecture... We've already shown that Transformer is inferior to xLSTM on DNA: arxiv.org/abs/2411.04165

PPushmeet Kohli@pushmeet · Jun 25

Happy to introduce AlphaGenome, @GoogleDeepMind's new AI model for genomics. AlphaGenome offers a comprehensive view of the human non-coding genome by predicting the impact of DNA variations. It will deepen our understanding of disease biology and open new avenues of research.

0

2

16

8

3.0K

Günter Klambauer Retweeted

P

Pushmeet Kohli@pushmeet · Jun 25

Happy to introduce AlphaGenome, @GoogleDeepMind's new AI model for genomics. AlphaGenome offers a comprehensive view of the human non-coding genome by predicting the impact of DNA variations. It will deepen our understanding of disease biology and open new avenues of research.

20

213

1.0K

382

130.0K

G

Günter Klambauer@gklambauer · Jun 24

Really cool new work with amazing students and collaborators.

JJiajun He@JiajunHe614 · Jun 12

[1/9]🚀Excited to share our new work, RNE! A plug-and-play framework for everything about diffusion model density and control: density estimation, inference-time control & scaling, energy regularisation. More details👇 Joint work with @jmhernandez233 @YuanqiD, Francisco Vargas

0

2

13

4

2.0K

Günter Klambauer Retweeted

S

Sepp Hochreiter@HochreiterSepp · Jun 20

NXAI has successfully demonstrated that their groundbreaking xLSTM (Long Short Term Memory) architecture achieves exceptional performance on AMD Instinct™ GPUs - significant advancement in RNN technology for edge computing applications. amd.com/en/blogs/2025/…

2

21

131

39

7.0K

Günter Klambauer Retweeted

R

Rianne van den Berg@vdbergrianne · Jun 18

🚀 After two+ years of intense research, we’re thrilled to introduce Skala — a scalable deep learning density functional that hits chemical accuracy on atomization energies and matches hybrid-level accuracy on main group chemistry — all at the cost of semi-local DFT. ⚛️🔥🧪🧬

5

60

282

121

29.0K

G

Günter Klambauer@gklambauer · Jun 18

Chemical accuracy with Deep Learning based DFT - #compchem

RRianne van den Berg@vdbergrianne · Jun 18

🚀 After two+ years of intense research, we’re thrilled to introduce Skala — a scalable deep learning density functional that hits chemical accuracy on atomization energies and matches hybrid-level accuracy on main group chemistry — all at the cost of semi-local DFT. ⚛️🔥🧪🧬

4

6

32

7

2.0K

G

Günter Klambauer@gklambauer · Jun 16

Parallelizable and state tracking and learnable information flow. Wowww. Super Work of Korbinian and team.

KKorbinian Poeppel@KorbiPoeppel · Jun 16

Ever wondered how linear RNNs like #mLSTM (#xLSTM) or #Mamba can be extended to multiple dimensions? Check out "pLSTM: parallelizable Linear Source Transition Mark networks". #pLSTM works on sequences, images, (directed acyclic) graphs. Paper link: arxiv.org/abs/2506.11997

0

16

117

40

5.0K

Günter Klambauer Retweeted

K

Korbinian Poeppel@KorbiPoeppel · Jun 16

Ever wondered how linear RNNs like #mLSTM (#xLSTM) or #Mamba can be extended to multiple dimensions? Check out "pLSTM: parallelizable Linear Source Transition Mark networks". #pLSTM works on sequences, images, (directed acyclic) graphs. Paper link: arxiv.org/abs/2506.11997

4

38

136

96

14.0K

G

Günter Klambauer@gklambauer · Jun 5

Learn more about TiRex and its potential. Time series are central to many business operations.

T@ ·

0

4

13

4

2.0K

G

Günter Klambauer@gklambauer · Jun 4

A European-developed TiRex is leading the field—significantly ahead of U.S. competitors like Amazon, Datadog, Salesforce, and Google, as well as Chinese models from companies such as Alibaba.

GGünter Klambauer@gklambauer · Jun 2

GIFT-Eval Time Series Forecasting Leaderboard Evaluates time-series forecasting methods. Now leading: TiREX arxiv.org/abs/2505.23719 Link: huggingface.co/spaces/Salesfo…

2

10

36

7

3.0K

G

Günter Klambauer@gklambauer · Jun 2

GIFT-Eval Time Series Forecasting Leaderboard Evaluates time-series forecasting methods. Now leading: TiREX arxiv.org/abs/2505.23719 Link: huggingface.co/spaces/Salesfo…

gklambauer's tweet image. GIFT-Eval Time Series Forecasting Leaderboard

Evaluates time-series forecasting methods. Now leading: TiREX arxiv.org/abs/2505.23719

Link: huggingface.co/spaces/Salesfo…

1

4

17

7

3.0K

G

Günter Klambauer@gklambauer · Jun 2

Europe is winning the AI race!! Best foundation model for time-series!

AAndreas Auer@AndAuer · Jun 2

We’re excited to introduce TiRex — a pre-trained time series forecasting model based on an xLSTM architecture.

0

2

23

4

5.0K

Günter Klambauer Retweeted

J

JCIM & JCTC Journals@JCIM_JCTC · May 30

MHNfs: Prompting In-Context Bioactivity Predictions for Low-Data Drug Discovery #DrugDiscovery pubs.acs.org/doi/10.1021/ac… @JSchimunek @sohvi_luukkonen @gklambauer @LITAILab #JCIM Vol65 Issue9 #ApplicationNote

0

3

8

3

2.0K

G

Günter Klambauer@gklambauer · May 30

Finally out!!!

JJCIM & JCTC Journals@JCIM_JCTC · May 30

MHNfs: Prompting In-Context Bioactivity Predictions for Low-Data Drug Discovery #DrugDiscovery pubs.acs.org/doi/10.1021/ac… @JSchimunek @sohvi_luukkonen @gklambauer @LITAILab #JCIM Vol65 Issue9 #ApplicationNote

0

2

9

0

944

G

Günter Klambauer@gklambauer · May 30

Indeed.. he gives some thoughts on that in the book..

WWill Hughes@woodtechwill · May 30

Just pre-ordered Hochreiter's book. His LSTM work changed AI - keen to see where he thinks it's headed next.

0

2

0

313