Jialuo Li (@JialuoLi1007)

Pinned

J

Jialuo Li@JialuoLi1007 · Apr 22

🚀 Introducing Science-T2I - Towards bridging the gap between AI imagination and scientific reality in image generation! [CVPR 2025] 📜 Paper: arxiv.org/abs/2504.13129 🌐 Project: jialuo-li.github.io/Science-T2I-Web 💻 Code: github.com/Jialuo-Li/Scie… 🤗 Dataset: huggingface.co/collections/Ji… 🔍…

JialuoLi1007's tweet image. 🚀 Introducing Science-T2I - Towards bridging the gap between AI imagination and scientific reality in image generation! [CVPR 2025]

📜 Paper: arxiv.org/abs/2504.13129
🌐 Project: jialuo-li.github.io/Science-T2I-Web
💻 Code: github.com/Jialuo-Li/Scie…
🤗 Dataset: huggingface.co/collections/Ji…

🔍…

4

33

140

91

42.0K

Jialuo Li Retweeted

H

Humphrey Shi@humphrey_shi · Apr 24

Over 4 years into our journey bridging Convolutions and Transformers, we introduce Generalized Neighborhood Attention—Multi-dimensional Sparse Attention at the Speed of Light: github.com/SHI-Labs/NATTEN A collaboration with the best minds in AI and HPC. 🐝🟩🟧 @gtcomputing @nvidia

0

34

126

81

13.0K

J

Jialuo Li@JialuoLi1007 · Apr 22

This paper is interestingly thought- provoking for me. There is a chance, that it's easier to "align t2i model with real physics" in post-training. And let it learn to generate whatever (physically implausible) combinations in pretrain. As opposed to trying hard to come up with…

JJialuo Li@JialuoLi1007 · Apr 22

🚀 Introducing Science-T2I - Towards bridging the gap between AI imagination and scientific reality in image generation! [CVPR 2025] 📜 Paper: arxiv.org/abs/2504.13129 🌐 Project: jialuo-li.github.io/Science-T2I-Web 💻 Code: github.com/Jialuo-Li/Scie… 🤗 Dataset: huggingface.co/collections/Ji… 🔍…

8

18

212

188

47.0K

Jialuo Li Retweeted

S

Sayak Paul@RisingSayak · Apr 18

Embedding a scientific basis in pre-trained T2I models can enhance the realism and consistency of the results. Cool work in "Science-T2I: Addressing Scientific Illusions in Image Synthesis" jialuo-li.github.io/Science-T2I-We…

1

15

103

37

5.0K

J

Jialuo Li@JialuoLi1007 · Apr 2

In Cambrian-1, we found that vision SSL representations usually lagged behind language-supervised ones -- but once the data gap is closed and scaling kicks in, performance catches up. We’ve tried scaling SSL before, but this is the first time I’ve seen real signal: SSL adapts to…

DDavid Fan@DavidJFan · Apr 2

Can visual SSL match CLIP on VQA? Yes! We show with controlled experiments that visual SSL can be competitive even on OCR/Chart VQA, as demonstrated by our new Web-SSL model family (1B-7B params) which is trained purely on web images – without any language supervision.

2

55

245

102

28.0K