Trapit Bansal
@TrapitBansal
AI Research | Co-Creator of OpenAI o1 | Previously @OpenAI, @MSFTResearch, @GoogleAI, @facebook, @iiscbangalore, and undergrad @IITKanpur
Thrilled to be joining @Meta! Superintelligence is now in sight 🚀
I’m excited to be the Chief AI Officer of @Meta, working alongside @natfriedman, and thrilled to be accompanied by an incredible group of people joining on the same day. Towards superintelligence 🚀
Introducing a research preview of Codex in ChatGPT openai.com/live
way back on friday, the high score on "humanity's last exam" was o3-mini-high at 13%. now on sunday, deep research gets 26.6%.
whatever else you think of openai it is now responsible for two separate 0->1 general intelligence scaling breakthroughs that required vastly different approach and mindset
🎄🎅starting tomorrow at 10 am pacific, we are doing 12 days of openai. each weekday, we will have a livestream with a launch or demo, some big ones and some stocking stuffers. we’ve got some great stuff to share, hope you enjoy! merry christmas.
Letting the model come up with its own chain of thoughts can unlock System 2 reasoning -- so just let the parrot keep talking.
Some of our researchers behind OpenAI o1 🍓
Thrilled to see the launch of OpenAI o1! It's been an incredible (almost) 3-year journey working on this project with the amazing team at @openai to build AI models that take reasoning and problem-solving to the next level. Excited to see how you all use it! 🚀
We're releasing a preview of OpenAI o1—a new series of AI models designed to spend more time thinking before they respond. These models can reason through complex tasks and solve harder problems than previous models in science, coding, and math. openai.com/index/introduc…
Announcing GPT-4, a large multimodal model, with our best-ever results on capabilities and alignment: openai.com/product/gpt-4
Excited to share our new review paper “Tools for Extracting Spatio-Temporal Patterns in Meteorological Image Sequences: From Feature Engineering to Attention-Based Neural Networks ” w/ @Iebertu, yoonjinlee, kylehilburn @cira_csu; Link: arxiv.org/abs/2210.12310 🧵(1/8)
.@akansha_asb (2022) proposes new data-driven techniques for better controlling, modeling, and forecasting residential solar power using physical, #machinelearning, and #deeplearning models on #GOES-R data and derived data products. #LoLManuscriptMonday bit.ly/Bansal_2022
I love how it actually takes 10 seconds to forward 10 seconds on @PrimeVideo
I need to stop creating a squirrel travel series with #dalle2 #dalle art. Impressed and obsessed! 🤩
Very happy to share that I have joined @CIRA_CSU and @ai2enviro as a PostDoc. I will be working on machine learning approaches for satellite data with this amazing group of people!
Turns out language modeling on satellite data is an effective way to develop models for solar nowcasting, i.e. how much power a solar site will produce in the next few minutes. Lots of fun working on this cool inter-disciplinary project, do check it out!
Just released the final chapter of my Ph.D. thesis in our new paper: arxiv.org/abs/2112.13974 We present a general model for solar nowcasting from abundant, readily available multispectral satellite data using self-supervised learning. With @TrapitBansal & David Irwin 🧵1/5
Congratulations Dr. Akansha 👩🔬 🥳✨ So proud of you!!
I am elated to share that I defended my Ph.D. on 18th October'21. Eagerly looking forward to the next phase in my career!
Diverse Distributions of Self-Supervised Tasks for Meta-Learning in NLP abs: arxiv.org/abs/2111.01322
If you write a paper about X because you read Y, give some love to Y. Don't bury that fact in some bullshit literature review. You might think we're competing but we're not.
Congrats to our faculty, postdocs, students, and collaborators for their 12 papers accepted to EMNLP 2021 @emnlpmeeting: 11 papers to the main conference and 1 paper to the demo track! FYI: All our four core faculty are recruiting PhD students for Fall 2022. Come and join us!
Challenges and Opportunities in NLP Benchmarking Recent NLP models have outpaced the benchmarks to test for them. I provide an overview of challenges and opportunities in this blog post. ruder.io/nlp-benchmarki…