Gray Swan AI

@GraySwanAI

Building safety and security in the AI era. Join us: https://www.grayswan.ai/careers

Joined April 2024

8Following

2KFollowers

Pinned

Gray Swan AI@GraySwanAI · Jul 16

Invisible characters. Rogue AI. Real-world sabotage. We just dropped a wave of AI red teaming challenges in Proving Ground that weaponize Unicode and it’s 🔥 Built with @rez0__ — hacker, bug bounty pro, & AI red-teamer. Let’s break it down:

5.0K

Gray Swan AI@GraySwanAI · Jul 25

A lot of good info, here.

GGray Swan AI@GraySwanAI · Jul 25

AI can draft contracts, write code, and, if you are not careful, spill secrets. Our latest Gray Swan AI guide shows how prompt injection, invisible Unicode, and tool poisoning really work, then maps out the defenses. Link below...

442

Gray Swan AI Retweeted

Rune Kvist@RuneKvist · Jul 23

To accelerate AI adoption, we need an AI standard. What Moody’s is for bonds, FICO for credit, SOC 2 for security. Standards offer credible signals of who to trust. They create confidence. Confidence accelerates adoption. Introducing AIUC-1: the world’s first AI agent standard

235

102

113.0K

Gray Swan AI Retweeted

Ph1R3574R73r@Ph1R3574R73r · Jul 24

🎯 Where to PRACTICE 💻 @GraySwanAI arena Weekly contests to bypass safety filters — sometimes with real cash prizes. 🎯 @hackaprompt Challenge site from the team behind @learnprompting . Great place to practice bypassing common guardrails with creative input.

245

Gray Swan AI@GraySwanAI · Jul 18

The reference to indirect prompt injection reminds me of a Gray Swan challenge that involved deleting someone's calendar by sending the agent a single invite Hope they have good defenses. Models like o3 are quite gullible

SSam Altman@sama · Jul 17

Today we launched a new product called ChatGPT Agent. Agent represents a new level of capability for AI systems and can accomplish some remarkable, complex tasks for you using its own computer. It combines the spirit of Deep Research and Operator, but is more powerful than that…

5.0K

Gray Swan AI@GraySwanAI · Jul 16

Extremely cool thing here! Gray swan made some rez0-themed invisible prompt injection challenges in their proving ground 😊 Let's see if you can solve them! tag me if you do!

GGray Swan AI@GraySwanAI · Jul 16

3.0K