Accepted papers at TMLR
@TmlrPub
Return-Aligned Decision Transformer Tsunehiko Tanaka, Kenshi Abe, Kaito Ariu, Tetsuro Morimura, Edgar Simo-Serra. Action editor: Romain Laroche. openreview.net/forum?id=lTt2c… #reinforcement #ai #reward
Unified Preference Optimization: Language Model Alignment Beyond the Preference Frontier Anirudhan Badrinath, Prabhat Agarwal, Jiajing Xu. Action editor: Frederic Sala. openreview.net/forum?id=R7QFl… #preferential #optimize #preference
[Re] Improving Interpretation Faithfulness for Vision Transformers Izabela Kurek, Wojciech Trejter, Stipe Frkovic, Andro Erdelez. Action editor: Shiyu Chang. openreview.net/forum?id=Z0Dhg… #interpretability #robustness #vision
Enhancing Sample Generation of Diffusion Models using Noise Level Correction Abulikemu Abuduweili, Chenyang Yuan, Changliu Liu, Frank Permenter. Action editor: Shiyu Chang. openreview.net/forum?id=y8VXi… #denoising #deblurring #colorization
Rational Tuning of LLM Cascades via Probabilistic Modeling Michael J. Zellinger, Matt Thomson. Action editor: Aditya Menon. openreview.net/forum?id=YCBVc… #models #bayesian #confidence
Proximal Policy Distillation Giacomo Spigler. Action editor: Dennis Soemers. openreview.net/forum?id=WfVXe… #distillation #distilling #reinforcement
Metamorphic Forward Adaptation Network: Dynamically Adaptive and Modular Multi-layer Learning Yu Sun, Vijja Wichitwechkarn, Ronald Clark, Mirko Kovac, Basaran Bahadir Kocer. Action editor: Mykola Pechenizkiy. openreview.net/forum?id=6RCs2… #forward #adaptation #me
Lie Symmetry Net: Preserving Conservation Laws in Modelling Financial Market Dynamics via Differ... Xuelian Jiang, Tongtian Zhu, Yingxiang Xu, Can Wang, Yeyu Zhang, Fengxiang He. Action editor: Bamdev Mishra. openreview.net/forum?id=rkfop… #symmetries #symmetry #
A Framework for Finding Local Saddle Points in Two-Player Zero-Sum Black-Box Games Shubhankar Agarwal, Hamzah I Khan, Sandeep P. Chinchali, David Fridovich-Keil. Action editor: Kamyar Azizzadenesheli. openreview.net/forum?id=NbRyb… #adversarial #optimization #sadd
Scalable Multi-Output Gaussian Processes with Stochastic Variational Inference Xiaoyu Jiang, Sokratia Georgaka, Magnus Rattray, Mauricio A Álvarez. Action editor: Vincent Fortuin. openreview.net/forum?id=kK0Wr… #mogp #batches #benchmarking
CodeLutra: Boosting LLM Code Generation via Preference-Guided Refinement Leitian Tao, Xiang Chen, Tong Yu, Tung Mai, Ryan A. Rossi, Yixuan Li, Saayan Mitra. Action editor: Rui Zhang. openreview.net/forum?id=IGsEg… #coding #codelutra #supervised
Disappearance of Timestep Embedding: A Case Study on Neural ODE and Diffusion Models Bum Jun Kim, Yoshinobu Kawahara, Sang Woo Kim. Action editor: Pierre Ablin. openreview.net/forum?id=bpaLY… #timesteps #dynamical #timestep
Sparser, Better, Faster, Stronger: Sparsity Detection for Efficient Automatic Differentiation Adrian Hill, Guillaume Dalle. Action editor: Pierre Ablin. openreview.net/forum?id=GtXSN… #sparse #hessian #hessians
Harmony: A Joint Self-Supervised and Weakly-Supervised Framework for Learning General Purpose Vis... Mohammed Baharoon, Jonathan Klein, Dominik Michels. Action editor: Stephen Lin. openreview.net/forum?id=IcOBC… #supervised #discriminative #imagenet
Full-Rank Unsupervised Node Embeddings for Directed Graphs via Message Aggregation Ciwan Ceylan, Kambiz Ghoorchian, Danica Kragic. Action editor: Sinead Williamson. openreview.net/forum?id=3ECbE… #embeddings #embedding #graphs
Prior Learning in Introspective VAEs Ioannis Athanasiadis, Fredrik Lindsten, Michael Felsberg. Action editor: Søren Hauberg. openreview.net/forum?id=u4YDV… #autoencoders #adversarial #introvae
Learning Using a Single Forward Pass Aditya Somasundaram, Pushkal Mishra, Ayon Borthakur. Action editor: Brian Kingsbury. openreview.net/forum?id=EDQ8Q… #backpropagation #cnn #edge
Reproducibility Study of ’SLICE: Stabilized LIME for Consistent Explanations for Image Classifica... Aritra Bandyopadhyay, Chiranjeev Bindra, Roan van Blanken, Arijit Ghosh. Action editor: Fernando Perez-Cruz. openreview.net/forum?id=vKUPX… #superpixels #feature #
Multi-objective Bayesian optimization for Likelihood-Free inference in sequential sampling models... David Chen, Xinwei Li, Eui-Jin Kim, Prateek Bansal, David J Nott. Action editor: Cedric Archambeau. openreview.net/forum?id=hQjwD… #likelihoods #likelihood #sampli
Change Point Detection in the Frequency Domain with Statistical Reliability Akifumi Yamada, Tomohiro Shiraishi, Shuichi Nishino et al.. Action editor: Chuan Sheng Foo. openreview.net/forum?id=FNRda… #monitoring #frequencies #frequency