Johnny Tian-Zheng Wei
@johntzwei
🇺🇸🇨🇳 PhD student at USC. I'm interested in the legal issues of AI.
Are you a researcher, trying to build a small GPU cluster? Did you already build one, and it sucks? I manage USC NLP’s GPU cluster and I’m happy to offer my expertise. I hope I can save you some headaches and make some friends. Please reach out!
Hi all, I'm going to @FAccTConference in Athens this week to present my paper on copyright and LLM memorization. Please reach out if you are interested to chat about law, policy, and LLMs!
Many works addressing copyright for LLMs focus on model outputs and their similarity to copyrighted training data, but few focus on how the model was trained. We analyze LLM memorization w.r.t. their training decisions and theorize on its use in court arxiv.org/abs/2502.16290
I recently wrote a comment suggesting that NIST should standardize best practices on the measurement and mitigation of LLM memorization. You can see it here: johntzwei.github.io/pdfs/strategic…
🚨 NIST is inviting input on draft AI standards — a rare chance to shape the rules of the game. Standards can carry enormous soft power: industry may adopt them as best practices, and judges may reference them when interpreting harms. nist.gov/artificial-int…
Hi all, reminder that our direct submission deadline is April 15th! We are co-located at ACL'25 and you can submit archival or non-archival. You can also submit work published elsewhere (non-archival) Hope to see your submission! sites.google.com/view/memorizat…
🚨 NIST is inviting input on draft AI standards — a rare chance to shape the rules of the game. Standards can carry enormous soft power: industry may adopt them as best practices, and judges may reference them when interpreting harms. nist.gov/artificial-int…
ACL is not an A* conference because people are not submitting A* work


📢The First Workshop on Large Language Model Memorization (L2M2) will be co-located with @aclmeeting in Vienna🎉 💡L2M2 brings together researchers to explore memorization from multiple angles. Whether it's text-only LLMs or Vision-language models, we want to hear from you!🌍
Our new workshop on Large Language Model Memorization will debut at ACL 2025 🎉 See you in Vienna!!
🎉 Happy to announce that the L2M2 workshop has been accepted at @aclmeeting! #NLProc #ACL2025 More details will follow soon. Stay tuned and spread the word! 📣
🎉 Happy to announce that the L2M2 workshop has been accepted at @aclmeeting! #NLProc #ACL2025 More details will follow soon. Stay tuned and spread the word! 📣
I’m a first time attendee at @AIESConf looking forward to meeting you all! I’m presenting a work on how statistical estimation applies to EU’s Digital Services Act. If you’re interested in the legal issues of LLMs or social media, I would be happy to meet you!
📣Hey #NLProc! We* are planning to organize a *ACL workshop on memorization in LLM. Goal: provide a central venue to discuss memorization from different angles (e.g., technical, legal, social, etc). We want to gauge community interest. Would you consider attending? (1/3)
Attending @aclmeeting next week -- currently interested in auditing, memorization, copyright, or any combination of the three. My focus is on applying technical methods to recent legal developments. I'm looking to meet others in my field, please reach out if this interests you!