Discussions with Claude about the AI “idiot savant” phenomenon

Back in July 2024, I was working with Anthropic’s Claude chatbot, and was getting frustrated with its (and ChatGPT’s) regular undermining of nearly-brilliant answers with ridiculously simple mistakes. I applied the phrase “idiot savant” to this phenomenon, though of course others were already describing it that way. I was also (somewhat relatedly) frustrated with Claude’s inability or refusal to provide sources or citations, not only for answers it generated (that’s somewhat understandable given modern AI models don’t store verbatim copies of their training material — a point often overlooked in public coverage of AI copyright litigation), but even when specifically asked to name two or three papers or researchers working in an area for which Claude had just given me a nice overview.

Attached are two transcripts, one from July 2024, the other from December 2024. In July 2024, I first asked Claude a silly test question about MICR codes at the bottom of checks, saw it overlooked a difference in Canadian MICRs that it clearly already knew about, asked it to provide sources, and then got into a lengthy discussion of the “idiot savant” problem, and various proposed solutions. When asked, it refused to give me even a single name of a researcher, much less cite to a paper, in fields it clearly knew a lot about. In December, it’s now willing to provide names and references, albeit with a DYOR caveat.

Of course, when querying an AI chatbat, “How did you come up with that answer?” or “What is your basis for that?” or more generally “How do you go about generating such answers?,” you can’t assume that its answers reflect any actual “introspection” into its own implementation,* as opposed to generalized (though often very useful) descriptions, based on its training data. Perhaps even “chain of thought” summaries provided by ChatGPT are open to question regarding how much this actually reflects steps they went through?

[Added Jan. 2, 2025: I should here mention a more recent chat with Claude about instruction fine-tuning, in the context of Claude Sonnet 3.5’s ability to take an uploaded file of (x,y) pairs of numbers, and try to determine what function generated the (x,y) pairs (i.e., Claude’s ability to perform regression on the fly, which seems quite different from answering a prompt based on next-token completion). When I tried this in Claude, in the course of coming up with its response to my prompt (which was nothing more than “I’ve uploaded a txt file with some data. Can you tell what’s going on in here?”), it generated an ad hoc JavaScript to visualize the trendline shown by the data. It only tried a linear relationship, and reported that the data represents “y = 0.0787x + 3.2711”, whereas a better fit would have been y=1.355791x^0.440975 (basically square root with some noise). But still! We then had an amazing chat trying to answer the question (“is this ability more of the same [in a good way], or something new?”) and it explained, with specific examples, how its instruction fine-tuning (related to FLAN, OpenAI’s InstructGPT, and Anthropic’s “Constitutional AI“; Askell paper) has enabled it to use OpenAI tools like repl, and how much this does and does not stem directly from its basic training. I’ll be writing this up separately soon; in the meantime, Claude chat on its data-analysis abilities from instruction fine-tuning here. See also excellent Nathan Lambert talk on the role of reinforcement learning in new “reasoning” models. See recent “reasoning” models like ChatGPT o1 (which is generally what I’ve been using in ChatGPT-related material on this site), forthcoming o3, or Google Gemini Flash 2.0 Experimental .]

[[TODO: note how the reference below to “Neuro-Symbolic AI” sounds ridiculous, given it was already clear that reasoning-like behavior was being implemented with extensions to the basic LLM architecture such as reinforcement learning (RL) and instruction fine-tuning, and that we probably do not need to add symbols back in to solve the idiot savant or hallucination/BS problem.]]

[[TODO: add reference to amusing “idiot savant date calculators” (people who can instantly tell you what day of the week it will be on Nov. 6 in the year 123470) test of neural networks, in Andy Clark, Associative Engines: Connectionism, Concepts, Representational Change, p.132 (idiot savant ability to perform remarkable feats of specific problem solving “despite low general intelligence”); related to divisibility test on pp.136-137 that Claude and I use in Python test (see forthcoming page on “What does it mean for a neural network to ‘learn square roots’?”). Both are in chapter on “The Role of Representational Trajectories.”]]

Here’s Claude’s summary of the Dec. 2024 chat:

Discussed Neuro-Symbolic AI as an approach to the “AI idiot savant” problem, combining neural networks with symbolic reasoning [AS: Note there’s no implication here that this is what new “reasoning” models are doing!]
Noted evolution in my ability to provide researcher names/citations (with verification caveats), compared to previous reluctance
Explored Knowledge Grounding, Coherence Modeling, and cited key researchers in these areas
Clarified that while I can suggest relevant citations, I cannot identify specific sources used in generating my answers
Discussed Explainable AI (XAI) and key researchers working on making AI systems more interpretable
Examined concerns about AI systems reproducing training material, including technical aspects (statistical pattern reproduction vs. verbatim storage) and legal/economic implications
Reviewed papers on extracting training data from language models and legal analysis of AI/copyright issues

And a discussion of the term “idiot savant” (which is actually less interesting than what you can read in the full transcript; not sure why I’ve extracted this portion):

Andrew Schulman (AS)
When was the phrase “idiot savant” first (or most famously) applied to AI models, by who and in what context? And while you’re at it, briefly explain the phrase “idiot savant,” and why it’s sometimes said to apply to brilliant-but-yet-somehow-dumb (sorry) AI models. (I think of it as “wait a minute, you seemed to know this and that amazing thing, but you didn’t know this much simpler thing?!”)*** For bonus points :-) compare and contrast to the “Rain Man” phenomenon.

Claude Sonnet 3.5

The term “idiot savant” originated in 1887 when Dr. John Langdon Down (who also identified Down Syndrome) used it to describe individuals with remarkable abilities in specific areas despite significant cognitive limitations in others.

Gary Marcus notably applied this analogy to AI in his 2001 book “The Algebraic Mind” and subsequent writings, highlighting how neural networks can excel at pattern recognition tasks while failing at simple logical reasoning.

[AS: Claude or Marcus might be making a partially false distinction: perhaps logical reasoning is, at bottom, largely pattern recognition (which in turn is enabled almost entirely from training with the seemingly narrow goal of error minimization) — after all, who says that understanding must be implemented from components that themselves exhibit understanding?** — with perhaps no more than 5% or 10% of it (or whatever falls into the idiot savant phenomenon and related shortfalls like BS/hallucination) not yet working the way we want, and possibly/probably only this small portion requiring something over and above “dumb” neural networks of weights/thresholds? Maybe even it’s pattern recognition “all the way down”?]

The “Rain Man” comparison is apt – like Raymond Babbitt’s character (based on real savants), AI models can perform incredible feats of calculation or pattern recognition while struggling with basic social understanding or common sense reasoning. However, unlike human savants who typically excel in one specific domain, modern AI models can be remarkably capable across many domains while still exhibiting surprising gaps in basic knowledge or reasoning.

Since I make claims about specific historical usage here, I encourage verifying these citations independently.

I also asked for a summary of the July 2024 chat:

AS
This chat is from back in July. Can you tell me which version of Claude I was interacting with here? Even if not, can you give me a bullet-point summary of what was discussed in this chat?

Claude
I cannot definitively identify which version of Claude was used in July, as that predates my knowledge cutoff.

Key topics discussed in this chat:

MICR (Magnetic Ink Character Recognition) on checks – its history, international usage, and routing number formats
An extended discussion about AI limitations, particularly the “idiot savant” quality where chatbots can be simultaneously impressive and make basic mistakes
The paradox of AI systems being able to provide some types of factual information (like Gaborone being Botswana’s largest city) but not others (like specific research papers)
Limitations around source citation and bibliographic information

The conversation highlighted these inconsistencies through direct examples and questioning

The discussion was notably reflective about AI capabilities and limitations, with the human participant pushing back on some of my explanations for knowledge gaps.

*Humans’ own “introspection” may be more a matter of post facto rationalization.

**Cue some statement from Heidegger about, … mumble …, things proceeding toward the center not requiring the characteristics of the periphery…? [No, it’s Wittgenstein, Zettel 608: “No supposition seems to me more natural than that there is no process in the brain correlated with associating or with thinking; so that it would be impossible to read off thought-processes from brain-processes. I mean this: if I talk or write there is, I assume, a system of impulses going out from my brain and correlated with my spoken or written thoughts. But why should the system continue further in the direction of the centre? Why should this order not proceed, so to speak, out of chaos? …”]
— see conversation with Claude about this quotation: it starts off thinking I mean Leibniz, it then quickly switches to Wittgenstein, identifies Zettel, narrows it down to Zettel 608, it then sort-of misunderstands (I think) what Wittgenstein was saying (and Claude may be too ready to agree with my interpretation, especially given I don’t know German, and was superficial enough to confuse Heidegger and Wittgenstein), but then we’re off to the races connecting Z608 to neural nets, and Claude starting to ask me questions — even as it acknowledges it can’t actually do anything with my answers; it is very explicit that it is asking questions simply because that’s a behavior that has emerged from its training. Even having a pretty decent sense of how these AI chatbots work (“it’s all just pattern matching and next-word prediction”), it’s pretty shocking that one can have a conversation like this.

***Or: “wait a minute, you seemed to know this and that complicated thing, but you didn’t know this much simpler thing, without which one can’t possibly understand what one is saying about the complicated thing?!” I learned something like this from my late father, who used to describe how a coworker would say “Hey Marvie, that was beautiful what you told me about [something to do with municipal bond refinancing]. I think I could write a f’ing [this was Wall St. in the ’70s] book about it based on what you told me. Now, there’s just one thing I’m saving trouble wrapping my mind around…” — and that one thing would, according to Dad, be The One Thing without which none of the rest of it made any sense (Net Future Value or something like that). On the other hand, one of the lessons from modern AI may be that understanding what one is talking about may not be necessary to say something useful and intelligent most of the time. “It has literally no idea what it’s talking about” is a problem, but it may be much smaller or less global problem than one might think.

Discussions with Claude about the AI “idiot savant” phenomenon

Search

Menu

Recent updates