Manifest AI

@manifest__ai

Joined December 2023

3Following

371Followers

Manifest AI@manifest__ai · Jul 8

Releasing Power Attention: manifestai.com/articles/relea…

12.0K

Manifest AI@manifest__ai · Nov 7

Had a great time talking Power Attention with the amazing folks at @GoogleDeepMind Montreal. Thanks @pcastr, Adrien, and Zhitao for hosting us!

manifest__ai's tweet image. Had a great time talking Power Attention with the amazing folks at @GoogleDeepMind Montreal. Thanks @pcastr, Adrien, and Zhitao for hosting us!

4.0K

Manifest AI@manifest__ai · Sep 24

Why gradient descent minimizes training loss: manifestai.com/articles/gd-mi…

10.0K

Manifest AI@manifest__ai · Aug 15

Symmetric power transformers: manifestai.com/articles/symme…

9.0K

Manifest AI@manifest__ai · May 16, 2024

In our latest article, we describe our methodology for research on extending context length. It’s not enough to train an LLM with a large context size. We must train LLMs with a large *compute-optimal* context size. manifestai.com/articles/compu…

95.0K

Manifest AI Retweeted

TwelveLabs (twelvelabs.io)@twelve_labs · Feb 7, 2024

In the 32nd session of #MultimodalWeekly, we will feature two speakers working with Transformers architecture research and LLMOps for generative AI applications.

3.0K

Manifest AI Retweeted

Jacob Buckman@jacobmbuckman · Jan 8, 2024

Anyone who has trained a Transformer has viscerally felt its O(T^2) cost. It is not tractable to train Transformers end-to-end on long contexts. Here's a writeup of the research direction I believe is most likely to solve this: linear transformers. manifestai.com/blogposts/fast… 1/7

164

165

89.0K

Manifest AI@manifest__ai · Jan 5, 2024

Sharing our work on how to efficiently implement linear transformers. We trained a GPT2 model with linear attention and observed a 32x speedup over FlashAttention on a 500k-token context. manifestai.com/blogposts/fast…

13.0K

Manifest AI@manifest__ai · Jan 5, 2024

Our mission: manifestai.com/blogposts/miss…

14.0K