Harsh Bhatt
@harshbhatt7585
19 | building RL Infra to train smaller models to compete with o3 prev ml http://remyx.ai http://secta.ai http://aragon.ai http://voice.ai alumni http://launchx.com http://tks.world
this is the video of last November, was building with arduino
**My vocabulary**: Experience Rollout Trajectories Multi-turn Rewards Momentum Optimizer Loss Add more ...
“By far, the greatest danger of Artificial Intelligence isthat people conclude too early that they understand it.” - Eliezer Yudkowsky
We are in the peak Engineering era; it's like singing. Anyone can write code like anyone can sing a song, but not everyone is Ed Sheeran or Arijit Singh.
.@elonmusk @Tesla It will be cool if Tesla can drive using Reasoning, probably need to have all the CV input that could be input for the reasoning model, and it needs to be finetuned through RL in a simulated environment.
"Once we turn all digital worlds into an environment, solve it with smart RL algorithms, we have digital AGI." - Shunyu Yao
My life is reinforced by the learning.
I do RL all day! :)))))