Gandalf Livestream: The Spells Behind Gandalf
Join us as we delve into the fascinating realm of large language models with a discussion about Gandalf.
The video is a great resource for anyone interested in learning more about Gandalf or the security of LLMs.
Not familiar with Gandalf yet?
The game is designed to test the security of large language models by challenging players to extract a password from the model. Your task is to outwit Gandalf to uncover the password, but the trick is that he adapts and strengthens his defenses with each level.
You can play Gandalf here.
We’re looking at a variety of topics, including:
- The history of Gandalf
- The different defenses that Gandalf uses to protect the password
- How the game is played
- Some of the strategies that players have used to solve the game
- The future of Gandalf
untouchable mode.
Lakera Guard protects your LLM applications from cybersecurity risks with a single line of code. Get started in minutes. Become stronger every day.
Several people are typing about AI/ML security. Come join us and 1000+ others in a chat that’s thoroughly SFW.