Anthropic Developing Constitutional Classifiers to Safeguard AI Models From Jailbreak Attempts

Anthropic Developing Constitutional Classifiers to Safeguard AI Models From Jailbreak Attempts

#news #newstoday #tech #technews #latestnews #techupdates #newsupdates Anthropic announced the development of a new system on Monday that can protect artificial intelligence (AI) models from jailbreaking attempts. Dubbed Constitutional Classifiers, it is a safeguarding technique that can detect when a jailbreaking attempt is made at the input level and prevent…