Stephen Oman
@stephen_oman
AI & ML. Doing Data Science @travelport. Maintaining @NetHack_LE Also here sometimes https://mastodon.ie/@StephenOman What problem are you solving today?
"Programming is not about typing, it's about thinking." — Rich Hickey v/@CodeWisdom
It would be useful to know how many bottles and cans were being recycled before this scheme was implemented. Otherwise “1.6 billion” tells us nothing about the success of the scheme.
The Irish public has collectively returned more than 1.6 billion bottles and cans through the Deposit Return Scheme since it kicked off at the beginning of last year. jrnl.ie/6773768
Forbes 30 under 30 has been doing this for years
It is with more sadness than mere words can convey that we have to report that our beloved Ozzy Osbourne has passed away this morning. He was with his family and surrounded by love. We ask everyone to respect our family privacy at this time. Sharon, Jack, Kelly, Aimee and…
💯 Who knew that the International Math Olympiad (IMO) is much easier than @NetHack_LE for AI.
Meanwhile, another wall - @NetHack_LE - is still standing firm and tall.
Long story short: after a series of misunderstandings, I shelled out more than EUR10,000 to prevent a German audiobook of my work from being released with DRM and now I need your help (assuming you speak German) to get the book into readers' ears! kickstarter.com/projects/docto… 1/
.@Replit goes rogue during a code freeze and shutdown and deletes our entire database
Mine is even simpler. Assuming it's a general purpose learning agent, let it read the NetHack Wiki and ascend once. Probably more of an ASI benchmark at this point.
My bar for AGI is far simpler: an AI cooking a nice dinner at anyone’s house for any cuisine. The Physical Turing Test is very likely harder than the Nobel Prize. Moravec’s paradox will continue to haunt us, looming larger and darker, for the decade to come.
Middle age spread. Happens to the best of us.
How it started vs How it's going 🤩 Which of our F1 cars is your favourite? 🤔
Douglas Adams is looking increasingly prophetic for predicting in Hitchhiker's Guide that computers would get very good at answering questions but would ultimately still need humans to ask the right ones
We ran a randomized controlled trial to see how much AI coding tools speed up experienced open-source developers. The results surprised us: Developers thought they were 20% faster with AI tools, but they were actually 19% slower when they had access to AI than when they didn't.
Welcome to the Chatsubo, cowboy. #Neuromancer is in production.
P = NP Our entire digital lives depend on it not being true.
As AI agents face increasingly long and complex tasks, decomposing them into subtasks becomes increasingly appealing. But how do we discover such temporal structure? Hierarchical RL provides a natural formalism-yet many questions remain open. Here's our overview of the field🧵
For perspective...
Happy "@NetHack_LE is still completely unsolved" day for those of you who are celebrating it. We released The NetHack Learning Environment (arxiv.org/abs/2006.13760) on this day five years ago. Current frontier models achieve only ~1.7% progression (see balrogai.com).…
Happy "@NetHack_LE is still completely unsolved" day for those of you who are celebrating it. We released The NetHack Learning Environment (arxiv.org/abs/2006.13760) on this day five years ago. Current frontier models achieve only ~1.7% progression (see balrogai.com).…
NetHack of course.
If you were paid per hour to play a video game as a job , but you could only play one game, which one would you choose
Excellent question by @vali_nasr. Here's a thread about Iran's nuclear program, why it's so hard to "destroy", and why "Fordow" is really about dragging the US into war:
Is Fordo really the most important thing about Iran’s nuclear program or is the focus on it designed to drag the US into the war? Since Fordo is the only target that needs US planes and bunker buster bombs.