Mona Jalal @ cvpr2025
@MonaJalal_
Computer Vision Research Engineer III at Toyota Material Handling #TeamToyota #OneToyota #ComputerVision #DeepLearning #cvpr2025 #cvpr
It’s my first time being recognized as an outstanding reviewer and I am so glad it is for ICCV conference. Thanks a lot to the conference organizers and chairs. @ICCV_2021

A sign Google is waking up is them dropping by far and away the biggest free plans across the industry. This time is for the Claude Code competitor. Super excited, next time can be them getting rid of "Gemini Advanced" or whatever it is. Free Gemini.
#CVPR2025 Paper Picks #4 ✨ShowUI (from late '24!) 2B vision-language-action model (smol🤏!) for GUI automation. An agent with visual understanding — no HTML, no accessibility trees, no extra text. Interacts like a human. 🥊 Outperforms much larger VLMs!
Very happy to receive a #SIGGRAPH2025 Best Paper Award (Honorable Mention) for our RSP algorithm, which dices surfaces into near-perfect rectangles. Such meshes are useful for everything from retopology, to microfluidic simulation, to textile design, to architectural geometry.
CVPR Highlight #1 VGGT: Visual Geometry Grounded Transformer (best paper award) 🚀 Super-fast 3D scene reconstruction 🤗 Model and demo on Hugging Face The result is a GLB point cloud 👇
I always like to see a creative oral presentation @kaist_ai @CVPR @soobin “MinorityPrompt:Text to Minority Image Generation via Prompt Optimization” #t2i #prompting #gpt #llm #computervision #ddim #optimization #creativeAI #genAI #ai #deeplearning #CVPR2025 #CVPR25 #cvpr #prompt




Very interesting 3D computer vision research presentation @HKUniversity @KAUST_News @CVPR #cvpr #3dcv #computervision #deeplearning #rgbd #optics #3D #camera




The paper Convex Relaxation for Robust Vanishing Point Estimation in Manhattan World introduces GlobustVP, a novel method for estimating vanishing points (VPs) in structured environments by reformulating the problem using convex optimization. github.com/WU-CVGL/Globus… @CVPR

Anthropic just opened the vault The best free masterclass on prompt engineering is here — and it’s a game-changer
Thanks a lot to Zhuoran Zhao for great walkthrough of her research poster at @CVPR on analyzing the synthetic-to-real domain gap in 3D hand pose estimation. She is looking for an internship if you want to connect with her. alicezrzhao.github.io @NUSingapore @HKUSTGuangzhou

3D reconstruction oral talk craftsman3d.github.io @AdobeResearch @CVPR @LightIllusions @hkust

SapiensID: Foundation for Human Recognition arxiv.org/pdf/2504.04708 #CVPR2025 @drexel_coas @michiganstateu @CVPR #reID #foundationmodels #computervision #deeplearning




Interesting joint work of computer vision and social media data retrieval for image editing- “Image Editing Works Better When Trainer On Our Real-world Editing Data” @CVPR @allen_ai @UW @csenews @Reddit #CVPR2025 #reddit #ai2 #UW @RanjayKrishna #instructpix2pix #deeplearning

Important advances in 6D pose estimation. Check One2Any paper at #CVPR2025 @CVPR @GoogleResearch @INSAITinstitute @ETH_en @TU_Muenchen #cvpr #6dpose #3dcv #objectmanipulation #computervision #computergraphics
One2Any: One-referencing 6D pose estimation for any object
Step 1: - Go to the @GeminiApp website. - Click on ‘Gem Manager’ - Select ‘New Gem Link: gemini.google.com
One2Any: One-referencing 6D pose estimation for any object

It was truly awe inspiring
AI Art Winner – Tom White Congratulations to @dribnet for winning a #CVPR2025 AI Art Award for "Atlas of Perception.” See it and other works in the CVPR AI Art Gallery in Hall A1 and online. thecvf-art.com @elluba
Top minds. Deep ideas. 🎯 Inductive Bias in 3D Generation 🗓️ June 12 — Day 2 of CVPR. Be there!
🔍 3D is not just pixels—we care about geometry, physics, topology, and functions. But how to balance these inductive biases with scalable learning? 👀 Join us at Ind3D workshop @CVPR (June 12, afternoon) for discussions on the future of 3D models! 🌐 ind3dworkshop.github.io/cvpr2025
🔎Can robots search for objects like humans? Humans explore unseen environments intelligently—using prior knowledge to actively seek information and guide search. But can robots do the same? 👀 🚀Introducing WoMAP (World Models for Active Perception): a novel framework for…
Welcome to #WiCV2025, maybe my favorite event, truly building community and guiding new generations. My mentors when I was a PhD, my fellow co-organizers now friends, and many I got the privilege to mentor; all are forever invaluable ❤️. Kudos to this year’s organizers! #CVPR2025