OpenAI, Anthropic Research Reveals More About How LLMs Affect Security and Bias

PulauWin — June 7, 2024 add comment

Anthropic opened a window into the ‘black box’ where ‘features’ steer a large language model’s output. OpenAI dug into the same concept two weeks later with a deep dive into sparse autoencoders.

Categories

OpenAI, Anthropic Research Reveals More About How LLMs Affect Security and Bias

Leave a Reply Cancel reply