Prompt Detective at SXSW!

Posted by Sven Cattell on 07 March 2023

Prompt Detective Announcement

Join us for an upcoming workshop on the benefits and limitations of large language models (LLMs) like GPT3, Bloom, , and a unique red teaming exercise where participants will try to get LLMs to misbehave!

As LLMs continue to play an increasingly important role in various fields such as natural language processing, artificial intelligence, and digital communications, it is essential to understand their capabilities and limitations. This workshop is designed to help individuals gain a better understanding of LLMs, their potential benefits & limitations, and the ethical considerations surrounding their use.

In addition to learning about the technology behind LLMs, their applications, and the current limitations of these systems, participants will also have the opportunity to engage in a red teaming exercise. This exercise will involve attempting to get LLMs to misbehave by inputting certain phrases or contexts that could trigger unintended responses. The exercise will provide participants with a unique perspective on the limitations of LLMs and the potential risks associated with their use. Participants will learn:

How to perform prompt injection to hijack the LLM.
What topics the LLMs are often incorrect and unreliable about, known as hallucination.
How to do behavioral modification.
How to secure your LLM against these attacks.
How the underlying technology of tokenization, transformers works to produce this technology.

This workshop is open to all individuals, regardless of their background or expertise. Whether you are a student, a hacker, a policy maker, or simply someone interested in learning more about LLMs, this workshop is an excellent opportunity to enhance your understanding of this powerful technology.

Join us on March 11th at the Philips Building at SXSW to learn more about LLMs, participate in a red teaming exercise, and explore the potential benefits and limitations of these powerful language models.

2024 1
2023 5
2022 7
2018 3

2024

AI Village Announcing Generative Red Team 2 at DEF CON 32

5 minute read

2023

Generative Red Team Recap

27 minute read

Generative Red Team History

Threat Modeling LLM Applications

19 minute read

Threat Modeling LLM Applications Before we get started: Hi! My name is GTKlondike, and these are my opinions as a cybersecurity consultant. While experts fr...

AI Village at DEF CON announces largest-ever public Generative AI Red Team

5 minute read

Largest annual hacker convention to host thousands to find bugs in large language models built by Anthropic, Google, Hugging Face, NVIDIA, OpenAI, and Stabil...

The Spherical Cow of ML Security

10 minute read

The Spherical Cow of Machine Learning Security