The 8088 The 8088 ← All news
arXiv cs.CL AI Research Apr 22

Talking to a Know-It-All GPT or a Second-Guesser Claude? How Repair reveals unreliable Multi-Turn Behavior in LLMs

★★★★★ significance 3/5

This study investigates how Large Language Models (LLMs) handle conversational 'repair' during multi-turn dialogues, specifically regarding math questions. The researchers found that different models exhibit varying levels of resistance or susceptibility to user-initiated corrections, leading to unpredictable behavior in long-form interactions.

Why it matters Discrepancies in how models accept user corrections signal fundamental gaps in reasoning reliability and conversational control during complex, multi-turn interactions.
Read the original at arXiv cs.CL

Tags

#llm reliability #multi-turn dialogue #conversational repair #error correction

Related coverage