Spade: An Innovative Approach to Synthesize Assertions for Identifying Errors in Large Language Models
A team of researchers from UC Berkeley, HKUST, LangChain, and Columbia University have developed a new system called Spade that automatically generates tests to identify errors in large language models(LLMs) like ChatGPT, Gemini, Claude, and Other…