Angoras Cat Splendor: The Bizarre Habit No One Can Explain. - Paparazzi Pulse
We introduce CLEVER, the first curated benchmark for evaluating the generation of specifications and formally verified code in Lean. The benchmark comprises of 161 programming problems; it evaluates.