As new large language models, or LLMs, are rapidly developed and deployed, existing methods for evaluating their safety and discovering potential vulnerabilities quickly become outdated. To identify ...
Erdos, explores what researchers call autoformalization, the process of converting traditional mathematical proofs into formats machines can verify using tools such as Lean and Coq.