Language – Skywritings

16th February 2026

DIC/ISC/CRIA Seminar in Cognitive Informatics

The unreasonable effectiveness of pattern matching

Gary Lupyan (University of Wisconsin-Madison)

February 19, 2026 10:30 – noon EDT

Zoom: https://uqam.zoom.us/j/82427157322

ABSTRACT: It has become common to explain the abilities of large language models (LLMs) as “mere” pattern matching. Because pattern matching is thought to be fragile and highly dependent on having exactly right training data, real thinking and reasoning of the kind that humans do is assumed to be implemented by qualitatively different mechanisms. I am going to argue that our intuitions about the limits of pattern matching are mistaken. At the core of this argument is a demonstration of an astonishing ability of LLMs to make sense of “Jabberwocky” language–texts in which most or all content words have been randomly replaced by nonsense strings–e.g., translating “He dwushed a ghanc zawk” to “He dragged a spare chair”. This ability highlights the unreasonable effectiveness of pattern matching and suggests that pattern matching is not an alternative to real intelligence, but its key ingredient.

Gary Lupyan is Professor of Psychology and Affiliate Professor of Philosophy at the University of Wisconsin–Madison. His research examines how language shapes cognition and perception, how language evolves, and the ways that languages adapt to the needs of their users and learners. His recent work centers in understanding what humans and artificial systems can and cannot learn from language and from the role of natural language input in building human-like intelligence.

Lupyan, G., & Arcas, B. A. y. (2026). The unreasonable effectiveness of pattern matching arXiv:2601.11432.

Lupyan, G., Gentry, H., & Zettersten, M. (2026). How Important Is Language for Human-Like Intelligence? Perspectives on Psychological Science, 17456916251398539 or here.

Wigner, E. (1960). The Unreasonable Effectiveness of Mathematics in the Natural Sciences. Communications on pure and applied mathematics, 12, 1–14.

DATE	SPEAKER	TITLE
Autumn 2025
September 11, 2025 10:30 – noon	Megan Peters, UC Irvine	Confidence, Metacognition, and the “Hard Problem” of Consciousness
September 18, 2025 10:30 – noon	Roger Levy, MIT	Behavioral evaluation of language models as models of human sentence processing
September 25, 2025 10:30 – noon	Chris Potts, Stanford	Meaning in Large Language Models: Bridging Formal Semantics, Pragmatics, and Learned Representations
October 9, 2025 10:30 – noon	Sean Trott, UCSD	Epistemological challenges in the study of “Theory of Mind” in LLMs and humans
October 16, 2025 10:30 – noon	Jean-Baptiste Mouret, INRIA, Nancy	Adaptive Embodied Agents: Implications for Grounding
October 23, 2025 10:30 – noon	Terry Sejnowski, Salk Institute	NeuroAI: The Convergence of Neuroscience and Artificial Intelligence
October 30, 2025 10:30 – noon	Yonatan Bisk, CMU	Embodied language and language‑to‑action: evaluating LLMs in interactive settings
November 6, 2025 10:30 – noon	Cameron Jones, SUNY Stony Brook	Do LLMs pass the Turing test? And what does it mean if they do?
November 13, 2025 10:30 – noon	Rufin VanRullen, CerCo, CNRS, Toulouse	The Global Latent Workspace: A model of cognition with AI applications
November 20, 2025 10:30 – noon	Ari Holtzman, U. Chicago	Articulating the Ineffable: The Analytic Turn in Generative AI
November 27, 2025 10:30 – noon	Chloe Clavel, INRIA	Computational Models of Socio-emotional Interactions in the Era of LLMs – the Challenges of Transparency
December 4, 2025 10:30 – noon	Dupoux Emmanuel EHESS, Paris	Is it really easier to build a child AI than an adult AI?
December 11, 2025 10:30 – noon	Sylvain Calinon, IDIAP, Suisse	Robot learning from demonstration

Winter 2026
January 15, 2026 10:30 – noon	David Strohmaier, U Cambridge	The symbol grounding problem 75 years after Turing’s Test (why computational success still leaves meaning unexplained)
January 22, 2026 10:30 – noon	Jacob Andreas, MIT	Systematic generalization (compositional structure in language models)
January 29, 2026 10:30 – noon (awaiting confirmation)	Thomas Serre, Brown	Cortical feedback mechanisms in visual reasoning: From perceptual grouping to abstract compositional reasoning
February 5, 2026 10:30 – noon	Rajesh Rao, Washington U	Predictive coding and generative models in natural and artificial intelligence

February 19, 2026 10:30 – noon	Gary Lupyan, Wisconsin	The unreasonable effectiveness of pattern matching

March 5, 2026 10:30 – noon	*Jacob Feldman, Rutgers*	Similarities and differences between AI and human learning in a rule-discovery paradigm
March 12, 2026 10:30 – noon	OPEN
March 19, 2026 10:30 – noon	Jean-Rémy King. ENS & Meta AI	Emergence of Language in the Human Brain
March 26, 2026 10:30 – noon	OPEN
April 2, 2026 10:30 – noon	Yair Lakretz, ENS Paris	Linguistic theory and deep language models
April 9, 2026 10:30 – noon	OPEN
April 16, 2026 10:30 – noon	Usef Faghihi, UQTR	From Seeing to Caring: A Ladder for Safe Superintelligence

11th February 202613th February 2026

Semaine 4 : Localisation cérébrale et neurones miroirs : Du « où et quand » à l’architecture fonctionnelle

La critique de Fodor : Localiser n’est pas expliquer. Jerry Fodor pose une question dérangeante : pourquoi cette fascination pour la localisation des fonctions mentales dans le cerveau ? Les techniques d’imagerie cérébrale nous révèlent où et quand certaines régions s’activent durant des tâches cognitives spécifiques. Un secteur s’illumine quand on pense aux théières, un autre quand on entend des noms plutôt que des verbes, un autre encore quand on voit des légumes. Mais quelle connaissance gagnons-nous vraiment ?

L’argument central de Fodor est simple mais dévastateur : savoir où se produit une fonction ne nous dit rien sur comment elle se réalise. Son analogie avec le carburateur est parlante. Comprendre le fonctionnement d’un moteur exige de saisir que le carburateur aère l’essence, qu’il contribue ainsi au fonctionnement global. Mais pourquoi importe-t-il de savoir où exactement le carburateur se situe dans le moteur ? À moins de vouloir le retirer chirurgicalement, cette information topographique n’ajoute rien à la compréhension du mécanisme.

De même pour le cerveau. Personne ne doute sérieusement que parler, faire du vélo ou construire un pont dépend de processus cérébraux quelque part au nord du cou. Mais pourquoi préciser à quel point au nord ? Fodor souligne que les neuroscientifiques accumulent des cartes cérébrales sans hypothèses claires sur ce qu’elles sont censées nous révéler. Ce sont des scientifiques « qui ont une caméra mais pas d’hypothèse ».

La corrélation entre activité neuronale et fonction mentale ne constitue pas une explication causale du fonctionnement. Elle nous dit « quand » et « où », mais reste muette sur le « comment » et le « pourquoi ». Cette critique méthodologique est redoutable : si la recherche la plus coûteuse en neurosciences se contente de localiser sans expliquer, elle absorbe des ressources qui pourraient servir à élucider les mécanismes réels. (Attention : Fodor ne conteste aucunement la valeur de la localisation à la neurologie ou à la neuropsychologie cliniques.)

Les neurones miroirs : Un cas d’école apparent. La découverte de Rizzolatti des neurones miroirs semble, à première vue, illustrer parfaitement la critique de Fodor. Ces neurones dans le cortex précentral ventral du macaque s’activent aussi bien quand le singe exécute une action (saisir un objet) que lorsqu’il observe un autre individu exécuter la même action. Des neurones aux propriétés similaires ont été localisés dans le lobule pariétal inférieur. Chez l’humain, les données d’EEG, MEG, TMS et d’imagerie (TEP, IRMf) confirment l’existence d’un système miroir.

Pendant des années, cette découverte semblait offrir peu plus qu’une cartographie sophistiquée : voici où se produit l’activité miroir, voici quand elle s’active. L’imitation est une capacité comportementale que nous savions déjà posséder, partagée avec de nombreuses espèces, opérant dans plusieurs modalités sensorielles et motrices. Mais identifier les corrélats neuronaux de cette capacité ne révélait pas comment le cerveau la réalise. Aucun message utile pour les roboticiens cherchant à rétro-ingénierier cette réciprocité entre la perception sensorielle et la production motrice. Ce nr sont que des paramètres de localisation spatiotemporelle pour une fonction qui est déjà connue. Cette lecture initiale inscrivait les neurones miroirs dans le paradigme que critique Fodor : beaucoup de technologie coûteuse pour localiser ce que nous savions déjà exister, sans avancer d’un pas vers la compréhension mécanistique.

Le tournant : De la localisation à l’architecture. Mais cette lecture était trop rapide. Ce qui est important n’est pas la localisation d’un type de neurone en un lieu précis, mais la découverte d’un pattern systématique : la fonction miroir se révèle être omniprésent, multimodal, et architecturalement canonique.

Rizzolatti montre que les neurones miroirs ne se contentent pas de coder des actes moteurs isolés. Certains sont « contraints par l’action » : ils ne s’activent que si l’acte moteur observé s’inscrit dans une action spécifique. Un neurone peut décharger pour « saisir-pour-manger » mais pas pour « saisir-pour-placer ». Cette spécificité permet à l’observateur non seulement de reconnaître l’acte moteur observé, mais aussi de coder ce que sera le prochain acte moteur, donc de comprendre l’intention de l’agent.

Au-delà de l’action motrice, le mécanisme miroir opère aussi pour les émotions. L’insula antérieure et le cortex cingulaire antérieur s’activent à la fois quand un sujet éprouve du dégoût directement (exposition à des odeurs répugnantes) et quand il observe l’expression faciale du dégoût chez autrui. Des résultats similaires émergent pour la douleur. Le mécanisme miroir constitue ainsi la base neuronale de l’empathie.

Ce qui importe n’est donc pas qu’il existe un « module du dégoût » localisé dans l’insula, mais que la même structure qui élabore nos propres réponses émotionnelles s’active lors de l’observation des émotions d’autrui. Le pattern révèle un principe architectural : la compréhension d’autrui procède par transcription dans nos propres systèmes moteurs et émotionnels.

Transcription sensori-motrice. Rizzolatti souligne un point crucial : l’observation visuelle seule, sans implication du système moteur, ne fournit qu’une description des aspects visibles du mouvement. Elle n’informe pas sur ce que signifie réellement cette action. Cette information ne peut être obtenue que si la forme de l’action observée est aussi codée dans le système moteur de l’observateur. L’activation du circuit miroir devient ainsi essentielle pour donner à l’observateur une compréhension réelle, expérientielle de la production de l’action perçue.

Voilà ce qui échappe à la critique de Fodor appliquée mécaniquement. Il ne s’agit pas simplement de savoir où se trouve un module d’imitation, mais de reconnaitre que l’apparence perceptive chez autrui est conforme avec sa production chez soi-même. Le mécanisme miroir relie le codage de ce que ça fait de voir l’action par les aires visuelles complexes, vers les aires motrices qui codent ce que ça fait de faire cette action. Ce n’est pas une simple corrélation topographique, c’est un principe fonctionnel : comprendre l’action d’autrui, c’est savoir comment (et pourquoi) la produire soi-même.

Implications pour les origines du langage. Le mécanisme miroir offre un cadre pour aborder les origines gestuelles de la parole (qui seront traitées dans les semaines 6 à 8). Depuis Condillac, plusieurs auteurs ont suggéré que la voie menant à la parole est née des communications gestuelles et non des cris des animaux. Les neurones miroirs créent un lien direct entre l’émetteur du message et le receveur. Grâce au mécanisme miroir, les actions exécutées par un sujet deviennent des messages compris par un observateur sans médiation cognitive.

L’observation d’un sujet saisissant une pomme est immédiatement comprise puisqu’elle évoque le même codage moteur dans le système miroir de l’observateur. Rizzolatti et Arbib ont proposé que le mécanisme miroir soit le système basique à partir duquel a évolué le langage. Le mécanisme miroir aurait résolu deux problèmes fondamentaux de la communication : la parité (ce qui compte pour l’émetteur compte aussi pour le receveur) et la compréhension directe (aucun symbole arbitraire n’est nécessaire).

Évidemment, le mécanisme miroir n’explique pas à lui seul l’extrême complexité de la parole. Mais il aide à résoudre une difficulté fondamentale : comment un message valable pour l’émetteur le devient également pour le receveur. La réciptocit.sensori-moteur fournit une base pré-linguistique pour la communication intentionnelle.

De la simple localisation au pattern architectural. La portée des neurones miroirs dépasse donc largement la simple localisation d’une fonction dans le cortex prémoteur ventral. Ce qui compte, c’est le pattern : la capacité de mirroring est omniprésente (motrice, émotionnelle), multimodale (vision-action, son-action, émotion-expression), et distribuée à travers plusieurs régions interconnectées (cortex prémoteur, lobule pariétal, insula, cortex cingulaire).

Ce pattern révèle quelque chose d’architectural sur l’organisation fonctionnelle du cerveau : la compréhension d’autrui, qu’elle soit motrice, émotionnelle ou communicative, procède par transcription dans les systèmes propres de l’observateur. Cette transcription n’est pas une simple « copie » passive, mais aussi une activation ressentie qui permet la compréhension.

Fodor a raison : savoir où se trouvent les neurones miroirs ne constitue pas en soi une explication. Mais le pattern systématique de leur distribution multimodale et de leur fonctionnement révèle un principe architectural : la perception et la production sont intimement couplées, la compréhension passe par la réciprocité sensori-motrice.

L’imitation et l’apprentissage. Rizzolatti distingue deux aspects de l’imitation : la capacité de reproduire immédiatement une action observée, et celle d’apprendre une nouvelle action par observation. La répétition immédiate est assurée par le système miroir seul. Mais l’apprentissage par imitation exige l’intervention du lobe préfrontal, qui combine des actes moteurs élémentaires codés par le système miroir pour produire des configurations motrices nouvelles.

Cette distinction est cruciale. Le système miroir fournit les « copies motrices » d’actions observées, mais la composition de nouvelles séquences motrices requiert des mécanismes combinatoires préfrontaux supplémentaires. Encore une fois, ce n’est pas la localisation qui importe, mais la compréhension que différents niveaux de capacités imitatives recrutent différentes architectures neuronales en interaction.

Implications cliniques et évolutives.

Les données montrent que les enfants autistes ont un déficit dans leur système miroir, et que la sévérité de leur affection corrèle avec l’importance de ce déficit. Ceci suggère que le système miroir joue un rôle dans les capacités sociales fondamentales, notamment la compréhension d’autrui et la communication.

D’un point de vue évolutif, on peut voir le mécanisme miroir comme une solution au problème de la communication interindividuelle et de la cognition sociale. L’évolution darwinienne – le « Blind Watchmaker » – a résolu le problème de l’implémentation de cette capacité. Mais identifier ce que l’évolution a construit ne dispense pas les sciences cognitives de leur tâche : la rétro-ingénierie de cette capacité. Comprendre comment le cerveau réalise le réciprocité perception-production reste un défi ouvert.

Conclusion : Au-delà de Fodor. La critique de Fodor garde toute sa force contre une neuroscience qui se contenterait de cartographier sans théoriser. Savoir où et quand certaines régions s’activent ne suffit pas. Mais lorsque les données de localisation révèlent un pattern systématique – la distribution multimodale et fonctionnellement cohérente du mécanisme miroir – elles contraignent nos théories sur l’architecture fonctionnelle.

Les neurones miroirs ne sont pas intéressants parce qu’ils sont dans le cortex prémoteur plutôt qu’ailleurs. Ils sont intéressants parce que leur pattern de distribution révèle un principe : la compréhension sensori-motrice, émotionnelle et communicative d’autrui procède par transcription dans nos propres systèmes. Cette transcription constitue le substrat expérientiel de la compréhension.

La question n’est plus « où sont les neurones miroirs ? » mais « comment le cerveau implémente-t-il cette capacité de mirroring multimodale ? » Voilà une question que même Fodor pourrait peut-être juger digne d’investigation. Elle ne demande pas simplement une caméra, mais une hypothèse sur le mécanisme qui permet à la perception de se transcrire en représentation motrice, réciproquement et d’ainsi fonder la compréhension.

L’évolution a résolu le problème de l’implémentation. Aux sciences cognitives de faire leur travail de rétro-ingénierie.

Post-scriptum : La critique de Fodor à propos de la neuro-imagerie et de la localisation ne concerne pas seulement le phénomène des neurones miroirs lui-même, mais aussi le fait que la neuro-imagerie permet de poursuivre l’étude des capacités miroirs chez l’humain de manière inoffensive, plutôt que par des expériences invasives et cruelles sur les singes. La neuro-imagerie rend ce remplacement possible aussi dans beaucoup d’autres domaines de recherche neuroscientifique qui causent la souffrance aux animaux non humains.

31st January 20264th February 2026

L’argument de La Chambre Chinoise de Searle

L’article de John Searle de 1980, Minds, Brains, and Programs, n’était pas une attaque générale contre l’intelligence artificielle, ni un plaidoyer mystique en faveur de l’ineffable humain. Sa cible était une thèse bien précise, qu’il a maladroitement baptisée « Strong AI », mais qui correspond à ce que l’on appelle plus clairement aujourd’hui le computationnalisme (« C = C »): l’hypothèse que la cognition n’est rien de plus que de la computation, autrement dit que les états mentaux sont des états computationnels, indépendants du support matériel qui les implémente.

Dans le cadre du cours, il est crucial de formuler correctement la cible de Searle. Il ne s’agit pas de savoir si les ordinateurs sont utiles pour modéliser la cognition (ce que Searle accepte), ni si des machines peuvent faire des choses impressionnantes. La question est celle-ci : si un système purement computationnel réussissait le test de Turing verbal (T2) à l’échelle d’une vie entière, serait-il pour autant en train de comprendre ce qu’il dit ? Le test est radical : pas un jeu de cinq minutes, pas une démonstration de surface, mais une indiscernabilité verbale durable avec des interlocuteurs humains normaux, sur n’importe quel sujet.

L’expérience de pensée de la Chambre chinoise suppose précisément ce cas. Imaginons qu’un programme permette à un ordinateur de passer avec succès un tel test en chinois. Searle, qui ne comprend pas le chinois, est placé dans une pièce et reçoit des chaînes de symboles chinois. À l’aide de règles formelles (un algorithme) exprimées en anglais, il manipule ces symboles et renvoie d’autres symboles chinois. De l’extérieur, à ces interlocuteurs chinois, le mots de Searle sont indistinguable de ceux d’un locuteur chinois natif : questions, réponses, discussions prolongées sur n’importe quel sujet imaginable. Pourtant, du point de vue interne, Searle n’a aucune compréhension du chinois. Il ne sait pas ce que signifient les symboles qu’il manipule.

L’argument repose alors sur un principe central du computationnalisme : l’indépendance du logiciel (l’algorithme) de son implémentation matérielle (la machine de Turing qui manipule les symboles (les mots chinois). Si comprendre une langue était une propriété purement computationnelle, alors tout système implémentant le bon programme devrait comprendre, indépendamment du matériel qui exécute le logiciel (un Mac, un PC) . Or ici c’est Searle qui exécute ce logiciel qui réussi le T2 chinois. Et pourtant, Searle ne comprend les symboles chinois qu’il est en train de manipuler. C’est ici qu’intervient ce qu’on a appelé plus tard « le périscope de Searle » : une occasion exceptionnelle de pénétrer la barrière des autres esprits. Normalement, une personne ne peut aucunement savoir si une autre personne ressent quoi que ce soit: une idée, une humeur, une sensation. Mais si la cognition était identique à une computation, alors en devenant lui-même l’implémentation matérielle du logiciel qui réussit le T2 chinois, Searle devrait lui-même ressentir la compréhension du chinois en exécutant toutes les manipulations de symboles qui font la compréhension du chinois (C=C). Mais il peut nous faire le témoignage: « Je manipule les symboles qu’on me donne comme entrées, selon les règles de manipulation qu’on me donne aussi, mais je n’en comprends absolument rien. Je ne comprends toujours pas le chinois. Donc la conclusion est inévitable : la cognition n’est pas de la computation (C≠C). Plus précisément, elle ne peut pas être exclusivement computationnelle.

C’est là que commencent les malentendus, souvent entretenus par Searle lui-même. La réponse la plus célèbre est la « System Reply » selon laquelle Searle ne serait qu’une partie du système; pourtant c’est le système global — Searle + les règles, les symboles, la pièce — qui comprendrait le chinois. Searle réplique facilement en internalisant le système intégral : il n’aurait qu’àmémoriser les règles et effectuer toutes les manipulations dans sa tête. Rien ne change : il n’y a toujours aucune compréhension. Cette réplique est décisive contre l’idée que la simple agrégation de composants syntaxiques (la manipulation des symboles de forme arbitraire d’après les règles) puisse engendrer une compréhension du sens.

Mais beaucoup de critiques ont refusé la conclusion en raison de formulations confuses. D’abord, l’usage par Searle des termes « Strong AI » et « Weak AI » a brouillé le débat. « Weak AI » ne désigne en réalité que la la thèse forte de Church-Turing : que la computation peut simuler pratiquement n’importe quel objet ou processus dans l’univers. Cette thèse est compatible avec l’argument de Searle. L’argument ne montre pas que la cognition ne peut pas être simulée, mais qu’une simulation computationnelle n’est pas, en elle-même, l’objet simulé: la bonne recette pour faire un gâteau végane n’est pas elle même le gâteau végane; et l’exécution de la recette végane n’est pas juste de la computation: c’est de l’impression 3D: On mélange les ingrédients, puis on fait cuire le gâteau au four. Ce qui n’est plus juste de la computation: Cuisiner n’est pas juste de la computation!

Deuxième confusion : l’idée que Searle aurait réfuté le test de Turing en tant que tel. C’est faux. L’argument montre seulement que T2, pris isolément et sous une interprétation strictement computationnelle, ne garantit pas la compréhension. Il ne dit rien contre T3 (ancrage sensorimoteur) ni contre T4 (duplication structurelle complète). En fait, l’argument laisse entièrement ouverte la possibilité qu’un système hybride — computationnel et non computationnel — puisse comprendre, ou qu’un système robotique ancré dans le monde puisse acquérir des significations que Searle, enfermé dans sa pièce, ne peut pas acquérir.

Troisième erreur fréquente : croire que Searle aurait montré que « la cognition n’est pas computationnelle du tout ». L’argument ne montre rien de tel. Il montre seulement que la cognition ne peut pas être uniquement computationnelle. La computation peut parfaitement jouer un rôle causal essentiel dans un système cognitif, sans en épuiser les propriétés sémantiques. Sur ce point, la « System Reply » avait une intuition juste, même si elle échouait comme réfutation : comprendre peut être une propriété d’un système global, mais pas d’un système purement syntaxique.

Enfin, Searle a lui-même surinterprété sa conclusion en suggérant que la solution devait nécessairement passer par la duplication des pouvoirs causaux du cerveau biologique. Rien dans l’argument n’impose un tel saut vers T4. Il reste une vaste gamme de possibilités intermédiaires : systèmes dynamiques non computationnels, architectures hybrides, réseaux neuronaux couplés au monde, agents sensorimoteurs apprenant par interaction. L’argument ne tranche pas en faveur des neurosciences contre la science cognitive. Il tranche uniquement contre le computationnalisme pur.

Malgré ces excès et ces confusions, l’importance historique de la Chambre chinoise est considérable. Elle a forcé la discipline à distinguer clairement syntaxe et sémantique, simulation et instanciation, performance verbale et compréhension. Elle a aussi préparé le terrain pour ce qui deviendra explicitement, quelques années plus tard, le problème de l’ancrage des symboles : comment des symboles formels peuvent-ils acquérir une signification intrinsèque pour un système, plutôt que seulement une interprétation extrinsèque par un observateur ?

La leçon méthodologique centrale est donc la suivante : passer un test comportemental, même très exigeant, n’explique pas en soi comment le sens est généré. L’argument de Searle ne ferme pas la route vers une science mécaniste de la cognition. Il ferme seulement une impasse : celle qui croyait pouvoir expliquer l’esprit par la manipulation de symboles non ancrés. En ce sens, loin d’être un obstacle, la Chambre chinoise a été un déclencheur. Elle a rendu inévitable la question qui structure la suite du cours : comment relier les symboles au monde, et le langage à l’action et à la perception.

24th January 2026

Le test de Turing et la rétro-ingénierie de la capacité cognitive

L’article de Turing de 1950 ne proposait ni un tour de passe-passe, ni un concours de tromperie, ni un jeu d’imitation à court terme. Il proposait un tournant méthodologique pour une future science de l’esprit : cesser de demander ce qu’est la pensée et chercher plutôt à expliquer comment les penseurs peuvent faire ce qu’ils peuvent faire. Le remplacement de la question vague « Les machines peuvent-elles penser ? » par un critère opérationnel ne visait pas à banaliser la cognition, mais à l’ancrer dans la capacité de performance empirique. L’enjeu n’était pas de duper des juges, mais de rétro-ingénier la totalité des capacités cognitives humaines de manière à les rendre reproductibles par une explication causale dont nous comprenons le fonctionnement. La question centrale n’est pas de savoir si une machine peut passer pour un penseur, mais comment et pourquoi les humains pensants peuvent faire tout ce qu’ils peuvent faire.

La terminologie malheureuse de « jeu » et d’« imitation » a entretenu une confusion durable. L’intuition méthodologique de Turing est que la cognition est invisible, alors que la performance ne l’est pas. Nous ne pouvons pas observer la pensée directement, ni chez autrui ni chez les machines, mais nous pouvons observer ce que les penseurs ont la capacité de faire. Le test n’a donc jamais porté sur la supercherie, mais sur l’indiscernabilité en capacité de faire (dites « l’indiscernabilité Turingienne »). L’interrogateur n’est pas un naïf, mais n’importe quel penseur humain neurotypique. Le véritable critère n’est pas que des personnes soient trompées, mais qu’il n’existe aucun moyen de distinguer le candidat d’un être humain normal à partir de sa performance observable. S’il y a une différence discernable, le candidat échoue. Sinon, alors la rétroingénieurie a réussi et le mécanisme interne qui a produit le succès constitue une potentielle explication causale de la capacité cognitive humaine.

Cela soulève immédiatement la question de l’étendue et de la durée du Test de Turing. La remarque occasionnelle de Turing sur cinq minutes et des pourcentages a été interprétée de façon absurde. Il s’agit d’une prédiction démographique, non d’un critère scientifique. La science cognitive n’est pas l’art de tromper certaines personnes pendant un certain temps. Un test sérieux de la cognition rétro-ingéniérée doit être ouvert et, en principe, valable sur toute une vie. Le candidat doit pouvoir continuer à faire ce que les humains peuvent faire, à travers les domaines et les contextes, sans s’épuiser dans des astuces pré-programmées ou des bases finies de cas. Un système qui s’effondre lorsque la conversation s’aventure en terrain imprévu, ou lorsqu’il est soumis à des sondages persistants, ne révèle pas une capacité cognitive générale. Il révèle un artefact de performance borné.

Tout aussi importante est la restriction au canal verbal. Turing a introduit l’interaction dactylographiée comme un moyen de mettre entre parenthèses l’apparence et les indices physiques non pertinents, non comme une thèse selon laquelle la cognition serait épuisée par le langage. L’exclusion de la voix, du geste et de l’incarnation visait à neutraliser des indices superficiels, non à nier que les humains sont des agents sensorimoteurs dans un monde physique. Interpréter le test comme intrinsèquement verbal revient à confondre une commodité méthodologique avec un engagement théorique. La capacité cognitive humaine n’est pas un simple module de clavardage. Elle est ancrée dans la perception, l’action et l’interaction causale avec le monde. Un système qui ne peut qu’échanger des symboles, sans pouvoir voir, se déplacer, manipuler et être affecté par son environnement, manque une grande partie de ce que les humains peuvent faire.

C’est pourquoi la distinction entre une indiscernabilité purement verbale et une indiscernabilité robotique complète est cruciale. Un système qui réussirait une vie entière d’échanges par courriel serait déjà une prouesse d’ingénierie remarquable, mais il laisserait ouverte la question de savoir si ce même système pourrait, par exemple, sortir, regarder le ciel et dire si la lune est visible, apprendre à utiliser des outils inconnus, se déplacer dans un environnement encombré, ou acquérir de nouvelles catégories ancrées dans les choses qu’il y a dans le monde auxquelles réfèrent leurs noms, par essais et erreurs. Ce ne sont pas des options accessoires. Elles font partie du répertoire ordinaire de la performance cognitive humaine. Traiter le langage comme un module autonome, c’est risquer de confondre une interface puissante avec un esprit complet.

Cela conduit à la question de la computation. Le travail de Turing sur la calculabilité, et la thèse de Church-Turing, portent sur ce qui peut être calculé par manipulation de symboles selon des règles. Ils n’affirment pas que tous les processus causaux sont computationnels, ni que la cognition n’est rien d’autre que de la computation. Le test lui-même est agnostique quant aux mécanismes internes. Il n’exige pas que le candidat réussi soit un ordinateur numérique. Ce qu’il exige, c’est que nous l’ayons construit et que nous comprenions, au moins en principe, comment il fonctionne. L’objectif est l’explication, non la simple duplication. Cloner un être humain, même si cela produisait un performer indiscernable, ne constituerait pas une explication de la cognition, car nous n’aurions rien rétro-ingéniéré. Nous aurions simplement reproduit ce que nous cherchions à expliquer.

Turing semble parfois glisser vers une restriction aux ordinateurs numériques, en partie en raison de l’universalité de la computation. Mais l’universalité de la simulation n’est pas l’universalité de l’instanciation physique. Un avion simulé ne vole pas, et un robot simulé n’agit pas dans le monde. L’équivalence formelle ne confère pas de capacité causale dans le monde réel. Un agent sensorimoteur virtuel dans un environnement virtuel peut être utile pour la modélisation et les tests, mais il ne satisfait pas en lui-même à un critère de performance dans le monde réel. Si la cognition dépend en partie d’un couplage sensorimoteur réel avec l’environnement, alors un système purement computationnel, aussi sophistiqué soit-il, peut ne pas satisfaire au critère complet de performance.

Il ne s’agit pas d’une thèse métaphysique sur l’incarnation pour elle-même. Il s’agit d’une thèse empirique sur ce que les humains peuvent faire. La compétence verbale humaine est plausiblement ancrée dans l’expérience non verbale (sensori-motrice, robotique. Une grande partie de ce que nous pouvons dire présuppose ce que nous pouvons voir, toucher, reconnaitre, identifier, dénommer, décrire, manipuler et apprendre par interaction. Un système qui n’a jamais rencontré le monde autrement que par le texte est contraint de s’appuyer sur des descriptions verbales indirectes produites par d’autres. Ce n’est pas équivalent à un ancrage sensorimoteur propre. La différence est décisive si l’objectif n’est pas de mimer un comportement de surface dans des contextes restreints, mais de correspondre à la capacité humaine générique.

Le succès contemporain des grands modèles de langage rend ce point particulièrement saillant. Ces systèmes présentent une fluidité verbale et une étendue de connaissances apparentes extraordinaires. Ils peuvent soutenir de longs échanges, s’adapter à de nombreux sujets et paraître souvent étonnamment humains dans des interactions textuelles. Mais ils y parviennent en s’entraînant sur des corpus massifs de langage produit par des humains. Ils héritent, en effet, d’un immense réservoir de descriptions verbales du monde de seconde main . Ce n’est pas un ancrage au sens pertinent pour la rétro-ingénierie de la cognition. C’est une structure empruntée. Le système n’a pas appris ses catégories en agissant dans le monde et en recevant un retour correctif. Il a appris des régularités statistiques dans le texte qui reflètent la manière dont des humains ancrés parlent du monde.

C’est ce qui rend le phénomène de la « grosse gorgée » à la fois fascinant et méthodologiquement trompeur. Il peut produire des performances verbales impressionnantes sans que le système lui-même ait l’histoire causale qui, chez les humains, sous-tend normalement cette performance. Il devient alors plus difficile de déterminer, à partir du seul comportement verbal, si le système possède une capacité générale ou s’il exploite un proxy massif mais en fin de compte fini de l’expérience. Une sonde de type Turing purement verbale devient donc de plus en plus vulnérable à des facteurs de confusion. Le système peut réussir de nombreux tests conversationnels non pas parce qu’il peut faire ce que les humains peuvent faire, mais parce qu’il a absorbé un enregistrement massif de ce que les humains ont dit sur ce qu’ils peuvent faire.

Cela ne montre pas que Turing avait tort avec son test. Cela montre que le canal verbal n’est plus un test de résistance suffisant. Si le test doit conserver son rôle comme critère de cognition rétro-ingéniérée, il doit être compris dans son sens complet, non abrégé. Le véritable étalon n’est pas une interface de clavardage, mais un système qui peut vivre dans le monde comme nous, acquérir de nouvelles catégories, apprendre des conséquences, corriger ses erreurs et intégrer perception, action et langage dans une seule capacité de performance cohérente.

La discussion par Turing des objections reste ici instructive. L’objection de Lady Lovelace, selon laquelle les machines ne peuvent faire que ce que nous leur disons de faire, repose sur une conception erronée des règles et de la nouveauté. Des systèmes gouvernés par des règles peuvent néanmoins produire des résultats imprévisibles en pratique, et le comportement humain n’est pas moins régi causalement par des régularités. La question profonde n’est pas de savoir si les machines peuvent nous surprendre, mais si nous pouvons expliquer comment un système en vient à posséder les capacités flexibles et ouvertes qui caractérisent les humains. La surprise est bon marché ; la compétence générique ne l’est pas.

De même, les arguments fondés sur Gödel concernant l’intuition mathématique manquent la cible s’ils sont interprétés comme montrant que la pensée humaine transcende mécaniquement toute explication causale. Savoir qu’une proposition est vraie n’est pas la même chose qu’avoir une preuve formelle, et aucun de ces faits n’établit, à lui seul, que la cognition ne puisse être mécanisée au sens de la performance pertinent pour le test. Le test de Turing ne tranche pas les questions métaphysiques sur l’esprit ou la conscience. Il fournit un critère d’adéquation explicative en science cognitive.

Cela conduit à la distinction cruciale entre faire et ressentir. Même un système qui satisferait pleinement au critère de performance ne serait pas, pour autant, connu comme ressentant. C’est le « problème des autres esprits », qui s’applique aussi bien aux humains qu’aux machines. Le test n’est pas une solution au problème de la conscience. C’est une solution au problème méthodologique de l’évaluation de l’explication de la capacité cognitive: le succès de la rétro-ingénierie. Un candidat réussi nous donnerait, au mieux, une explication de la manière dont le faire est généré. La question de savoir s’il y a du ressenti, et comment le ressenti surgit, resterait un problème distinct, et peut-être insoluble.

Dans cette perspective, les affirmations selon lesquelles les LLM actuels auraient « réussi le test de Turing » confondent une indiscernabilité locale, à court terme et textuelle, avec une capacité cognitive générique, incarnée et valable sur toute une vie. Elles confondent également la tromperie démographique avec l’explication scientifique. Un système qui peut induire en erreur une fraction de juges pendant quelques minutes n’a pas, pour autant, été montré comme possédant une cognition de niveau humain. Il a montré que nos intuitions verbales (et nos capacités neurones-mirroir) sont faillibles et que la fluidité de surface est plus facile à obtenir qu’une compétence profonde et ancrée.

La contribution durable de Turing n’a pas été de nous donner un jeu de société, mais de fixer un programme empirique de recherche. La science cognitive, dans cette optique, consiste à rétro-ingénier la capacité de faire ce que les penseurs peuvent faire. Le test est le critère d’aboutissement de cette entreprise, non un raccourci pour la contourner. Si l’on prend cela au sérieux, le véritable défi n’est pas de construire de meilleurs bavards, mais de construire des systèmes capables d’agir, d’apprendre et de vivre dans le monde d’une manière indiscernable, en principe et en pratique, de ce que les humains peuvent faire au cours d’une vie. Ce n’est qu’alors qu’il serait raisonnable de dire que le projet de rétro-ingénierie a réellement abouti.

Turing, A. M. (1950/1990). Machines informatiques et intelligence. Mind, 49, 433-460.

Harnad, S. (2008) The Annotation Game: On Turing (1950) on Computing,Machinery and Intelligence. In: Epstein, Robert & Peters, Grace (Eds.) Parsing the Turing Test: Philosophical and Methodological Issues in the Quest for the Thinking Computer. Springer

29th December 202510th January 2026

The Senses of Sentience

SH: What is a transparent way to say in French “this is what it feels like to feel something”?

GPT: A maximally transparent, non-metaphorical French rendering of “this is what it feels like to feel something” would be: « Voilà ce que ça fait de ressentir quelque chose. » This is idiomatic, minimal, and avoids philosophical embellishment.

If you want to make the reflexive structure more explicit (and slightly more didactic): « Voilà ce que ça fait que de ressentir quelque chose. » (grammatically fine, a bit heavier)

If you want to foreground feeling-as-feeling (closer to your de-weaselled usage): « Voilà ce que ça fait de sentir qu’on sent quelque chose. » (more explicit, but risks sounding artificial)

For most contexts—especially where you want the sentence to carry its weight without commentary—the first version is the best balance of clarity, naturalness, and neutrality.

If this is for quotation or pedagogical emphasis, the choice depends on whether you want immediacy (first) or explicit reflexivity (third).

SH: “ce que ça fait de” rather than “à quoi ça ressemble de” ?

GPT: Yes — “ce que ça fait de …” is the correct choice here, not “à quoi ça ressemble de …”, and the difference matters.

« ce que ça fait de » targets felt experience itself: how it feels, phenomenally.
« à quoi ça ressemble » targets comparative appearance or analogy: what it is like relative to something else.

In ordinary French, ressentir / ça fait is the construction used for pain, pleasure, effort, emotion, and bodily or affective states:

Voilà ce que ça fait d’avoir mal.
Voilà ce que ça fait d’être amoureux.

By contrast, « à quoi ça ressemble » invites description by resemblance:

À quoi ça ressemble, un trou noir ?
À quoi ça ressemble, la vie sur Mars ?

Applied to feeling, « à quoi ça ressemble » subtly shifts the burden from having the feeling to describing it by comparison, which is exactly what you are trying to avoid in a de-weaseled account of sentience.

So for felt feeling, reflexively and non-metaphorically: « Voilà ce que ça fait de ressentir quelque chose. » That is the idiomatic, phenomenologically direct French equivalent.

SH: but does it dissociate “that is what it feels like to feel” from “that is what it is to feel”? (Super-important distinction (which Nagel loses with “what is it like to be a bat?” rather than “what does it feel like to be a bat?”.

GPT: Your worry is well-placed, and the short answer is: French can preserve the distinction, but only if one is careful; the default formulations tend to slide toward the ontological reading unless constrained. Let me be precise.

1. The danger point

In French, as in English, there is a latent ambiguity between:

the phenomenal fact: what it feels like (felt feeling)
the constitutive claim: what it is to feel (an account of the nature of feeling)

Nagel’s “what is it like to be a bat?” collapses the two by using “is,” inviting metaphysical inflation. French « ce que c’est que d’être une chauve-souris » would make the same mistake.

2. Why « ce que ça fait de » is safer

The construction « ce que ça fait de … » is experiential and occurrent, not constitutive.

Voilà ce que ça fait d’avoir mal.
Voilà ce que ça fait de courir un marathon.

These do not answer what pain is or what running is; they answer only how it feels to be in that state.

So: « Voilà ce que ça fait de ressentir quelque chose. » is correctly read as: This is the felt character of feeling something. —not as: This is what feeling consists in.

3. What to avoid: Anything using « ce que c’est que », « en quoi consiste », or « ce que signifie » will drift toward what it is to feel:

❌ « Voilà ce que c’est que de ressentir quelque chose. »
(ontological / constitutive)
❌ « En quoi consiste le fait de ressentir… »

4. If you want maximal insulation

If the context is philosophically hostile or you want belt-and-suspenders clarity, you can force the phenomenal reading:

« Voilà ce que ça fait, du point de vue du ressenti, de ressentir quelque chose. »
« Voilà ce que ça fait, tel que ça se ressent, de ressentir quelque chose. »

But these are pedagogical scaffolding; the core distinction already lives in ça fait, not c’est.

Bottom line:

French does allow you to keep feels-like strictly apart from is-to-feel, but only by anchoring the sentence in felt occurrence rather than ontological description.

« Ce que ça fait de… » does that; Nagel’s formulation does not.

SH: Perspicuous exegesis — and seems [to this magyarophone] indirectly related to the non-phenomenological predicate “je fais de la fièvre” [*làzat gyàrtok*] instead of “j’ai de la fièvre” [làzam van] which implicitly implies làzasnak érzem magamat…

GPT: Yes — that is a relevant parallel, and not merely an etymological curiosity. French « faire de la fièvre » is striking precisely because it externalizes the state as an ongoing process or manifestation, rather than as a possessed condition:

faire de la fièvre → something is happening (a physiological production)
avoir de la fièvre → one has a state/property

Hungarian makes the contrast even sharper, as you note:

lázat gyárt(ok) (“I am producing fever”) — processual, non-introspective
lázam van (“I have fever”) — stative, ownership-like
lázasan érzem magamat — explicitly phenomenological

The key point, aligned with your concern about Nagel, is this:

“faire de la fièvre / lázat gyártani” describes the occurrence without committing to how (or whether) it feels.

Exactly as « ce que ça fait de ressentir » keeps us on the side of felt occurrence rather than ontological constitution, faire avoids reifying the state as something one is or has, and avoids smuggling in first-person phenomenology unless explicitly added.

So the parallel is not incidental. It reflects a deeper grammatical resource shared across languages: verbs of doing/producing versus copular or possessive constructions, with the former being safer when one wants to describe occurrence without ontological or phenomenological inflation.

Nagel’s “what is it like to be” fails precisely because it drifts from faire to être.

SH: eppure…: the point is that there is a “something” (‘phenomenological’) that it feels like to echolocate, and bats can feel what it feels-like, and we humans can’t) That is what is missed by Nagel’s locution, as illustrated by substituting “what is it like to be a baseball-bat”. Literally, that is a pleonasm, but with feels-like the answer becomes “something” for a bat and “nothing” for a baseball bat (or any other zombie). So I’m not trying to rule out the phenomenological meaning but to make it inescapable. Does “ça fait” do the job too?

GPT: Yes — “ça fait” can do the job, but only if it is allowed to bite, i.e., if the construction is explicitly tied to the existence vs. non-existence of felt content. Let me make the distinction sharp.

1. Your target distinction (made explicit)

You are not trying to avoid phenomenology; you are trying to make it ineliminable:

There is something it feels like to echolocate — for a bat.
There is nothing it feels like to be a baseball bat — or a zombie.
This “something vs. nothing” is the datum.
Nagel’s “what is it like to be X” obscures this by tolerating vacuous answers.

2. Does « ça fait » force the “something”?

Ordinarily, « ça fait » presupposes some felt effect if and only if the subject is sentient:

Qu’est-ce que ça fait d’avoir mal ? → necessarily something
Qu’est-ce que ça fait d’être une pierre ? → rien (or category error)

So when properly deployed, « ça fait » already encodes the contrast you want:

« Pour une chauve-souris, ça fait quelque chose d’écholocaliser.
Pour une batte de baseball, ça ne fait rien. »

That opposition is natural French and is not metaphorical.

3. Why this succeeds where Nagel fails

Nagel’s “what is it like to be…” allows a slide into:

structural description,
imaginative analogy,
third-person modeling.

By contrast, « qu’est-ce que ça fait » demands an answer on a single dimension:

Is there anything it feels like, or not?

That yields exactly your diagnostic contrast:

bat → quelque chose
baseball bat / zombie → rien

4. If you want absolute inescapability

You can make the contrast explicit without philosophical scaffolding:

« Ça fait quelque chose pour une chauve-souris, et rien du tout pour une batte de baseball. »
« S’il n’y a rien que ça fait, il n’y a rien qui est ressenti. »

This leaves no room for ontological inflation or metaphor.

Bottom line

Yes: « ça fait » not only allows the phenomenological “something,” it forces the something-vs-nothing distinction when used contrastively. Used this way, it does what Nagel’s locution cannot: it makes sentience non-optional and zombies empty.

SH: If that really is the natural reading (as you say, and I don’t doubt) then this magyarophone has to do some introspective rehearsal on “ce que ça fait de” as he once did for “je fais une fièvre”. (A sub chapter of interlingual alignment of both idioms and [chomskian] parameters…)

GPT: That strikes me as the right diagnosis. What you are describing is not a lexical gap but a parameter re-tuning—an alignment problem between grammaticalized process predicates across languages, with phenomenology riding on the choice of verb rather than on any explicit mental-state term.

Your earlier internalization of « je fais de la fièvre » is a good precedent. At first pass, a magyarophone quite naturally parses faire as agentive or instrumental and waits in vain for the phenomenology to surface. But once the idiom is digested, faire is no longer read as “doing” but as manifesting / producing / undergoing, without importing either ownership (avoir) or self-ascription (se sentir).

« ce que ça fait de… » works the same way. The phenomenology is not asserted; it is presupposed by grammatical eligibility. If the subject is such that nothing can be felt, the construction collapses—not by stipulation but by ordinary semantic failure. That is precisely why it can carry your “something vs. nothing” load without metaphysical overstatement.

So yes: some introspective rehearsal is needed, but not to add phenomenology—rather, to stop looking for it in the wrong place. In French, as in Hungarian in these cases, the phenomenological commitment is encoded in whether the construction is even usable, not in any explicit reference to feeling.

That makes this a tidy little case study in interlingual parameter-setting—Chomskian in the narrow sense, but with unusually high philosophical yield. —

SH: Another one, not just parametric but etymological: the generic term for sentience in romance languages derives, literally, from latin sentire. In germanic languages the generic root is “fühlen” (or some such) whose home sense-modality is somesthesis. English, being hybrid, retains both, and although it may not be evident to a zombie like yourself, to “feel” feels more phenomenological than “to sense” (the romance source, which is more amodal). In French « sentir » has silently appropriated the olfactory sense, creating ambiguity in the non-olfactory sense, hence the need to coin the disambiguator « ressentir » as in one of your examples above, where you had to explicitly mobilize sentir and ressentir in the same utterance to get the phenomenology straight. Your take on this is of interest not just because you are anosmic!

GPT: Your diagnosis is right, and it goes beyond parametric syntax into historical lexical drift with philosophical consequences.

Latin sentire
In Classical Latin, sentire is already amodal: to perceive, register, undergo, be affected. It covers sensation, emotion, opinion, even judgment (sententia). Phenomenology is implicit but not localized to any modality. Romance languages inherit this generality.
Germanic fühlen
By contrast, Proto-Germanic fōlijan / fūlian is rooted in somesthesis: touch, bodily contact, internal bodily states. Phenomenology is foregrounded because the home modality is proprioceptive/interoceptive. Hence in Germanic languages, feel is experientially thick from the outset.
English hybridity
English preserves both lineages:

sense ← Latin (sentire): amodal, classificatory, third-person compatible
feel ← Germanic (fühlen): first-person, bodily, phenomenological

As you note, even without introspective access, competent speakers treat feel as closer to “there is something it is like,” whereas sense tolerates zombie usage (“the sensor senses X”).

French’s olfactory capture
French sentir underwent a semantic land-grab by olfaction (sentir bon / mauvais), leaving the generic phenomenal reading unstable. The language then repaired itself by lexical doubling:

sentir → increasingly exteroceptive / olfactory
ressentir → re-internalized, affective, phenomenological

Hence the odd but necessary constructions:

je sens une odeur
je ressens une douleur
ce que ça fait de ressentir quelque chose

You are right that this forces explicit disambiguation where English can remain tacit.

Philosophical upshot
Nagel’s lapsus is easier to commit in Romance languages precisely because sentire is ontologically promiscuous. Germanic feel resists that slide. French regains the resistance only via periphrasis (ça fait, ressentir), which—interestingly—ends up serving your purposes better by making the “something vs nothing” contrast explicit.

So yes: this is not just linguistics trivia. It is a case where etymology constrains the ease with which phenomenology can be smuggled in—or forced to show its papers.

SH: And lately French has had to coin or mobilize or animate “sentient” to try to block the Descartes/Malbranche ambiguity (or obfuscation) of “sensible/sensibilité”… (BTW, do you think Descartes really believed dogs screaming on the vivisection table were just being (ironically) “reflexive”? (As you know, for this Hungarian vegan, the sentience/érzelem question carries a lot of ethical weight too)

GPT: Yes—this is another case where lexical repair is doing ethical work.

1. Sentient vs sensible/sensibilité in French

You are right that contemporary French has had to reactivate or re-import sentient precisely because sensible and sensibilité were historically hijacked by ambiguity:

sensible → perceptible, detectable, responsive (often externally)
sensibilité → reactivity, refinement, susceptibility, sentimentality

Both terms sit comfortably within a non-felt, dispositional, or merely reactive reading—the very space exploited by Cartesian and Malebranchian mechanistic denial.

Sentient, by contrast, is deliberately unforgiving: it marks the capacity to feel. It blocks the slide from:

“responds to stimuli”
to
“feels something”

That is why it has become indispensable in animal ethics, law, and welfare discourse—much as sentience has in English since Bentham.

2. Did Descartes really believe dogs didn’t feel?

The historical evidence suggests a mixed but troubling answer.

Descartes himself was careful, evasive, and inconsistent. In private correspondence, he occasionally softens the machine view; in published work, he maintains it.
Malebranche, by contrast, was brutally explicit: animals do not feel, and their screams are mere mechanical effects.

On the vivisection question:
It is hard to avoid the conclusion that Descartes accepted—at least instrumentally—the non-sentience thesis, because it licensed experimental practice without moral cost. Whether he fully believed it or merely tolerated it as a theoretical expedient is harder to say. But the distinction is ethically thin.

The “reflex” story was not an ironic aside; it was a theoretical shield.

3. Ethical weight (and why this still matters)

Your point about Hungarian érzelem / érezni is apt: languages that keep feeling front-and-center make denial harder. Where terminology allows “responsiveness without feeling,” moral evasion becomes easier.

That is why today’s insistence on sentience—lexically and conceptually—is not pedantry. It is a countermeasure against centuries of motivated ambiguity.

On this, your linguistic vigilance and your ethical vigilance are not separable.

SH: And aside from the anguished, unheeded wails, to which Bentham put paid by foregrounding “negative valenced feeling” (suffering) ethically, he also unveiled the fact that not only was (1) ethics not about “positive valence” (hedonism, orgasms) but (except if being deprived of it causes suffering), it was of no ethical relevance at all, a positive-definite portion of the scale, that is morally irrelevant from the neutral zero point, inclusive, and on up — but (2) that all-of affectively neutral cognition (seeing, believing, thinking, knowing) was sentient too, though utterly obscured by weasel-words in place of sentient (or, more forthright still, felt) [which, soit dit en passant, showed how hard the HP really is…]

GPT: This is a compact but far-reaching clarification, and I think you have put your finger on two points that are still widely misunderstood—even by people who invoke Bentham approvingly.

First, on negative valence. Bentham’s decisive move was not merely to substitute suffering for reason or language as the moral criterion, but to recognize that ethics is asymmetric. Moral urgency attaches to harm, not to pleasure. Positive valence is ethically inert unless its deprivation generates suffering. Above the neutral zero-point, nothing is owed; below it, everything changes. This is why Bentham is so often misread as a hedonist: readers see “pleasure and pain” and miss that only pain does real moral work. Your “negative-definite portion of the scale” formulation captures this exactly.

Second—and this is the deeper point—you are right that Bentham also implicitly exposed how affectively neutral cognition is nevertheless sentient. Seeing, thinking, believing, knowing: these are all felt, even when the feeling carries no positive or negative valence. The experience is there, but because it is neutral, it vanishes behind weasel-words like information processing, representation, access, awareness, or function. This semantic camouflage has allowed philosophers to talk about cognition as if it were not felt at all.

And that, as you note, is precisely why the Hard Problem is so hard. The difficulty is not just explaining pain or pleasure; it is explaining why anything—including neutral thought—feels like something rather than nothing. Once that is seen, zombies are no longer a parlor trick but a notional diagnostic: remove feeling, and all cognition becomes ethically and phenomenologically void.

Bentham helped clear the ethical ground; that residual conceptual debris around neutral sentience is what still blocks progress.

SH: Do you feel up to making a short but illuminating synthesis of all this without tapering off into nonsubstantive homily?

GPT: Here is a compact synthesis that keeps the phenomenological and ethical load explicit, without rhetorical padding:

Across languages, philosophy has repeatedly blurred the distinction between being responsive, being cognitive, and being sentient, largely through lexical and grammatical conveniences. Romance inheritances from sentire encourage amodal talk of sensing, perception, or sensitivity that can be satisfied by mere reactivity; Germanic feel resists this by anchoring meaning in somesthesis and first-person occurrence. French, having let sentir drift toward olfaction and external detectability, has been forced to repair the gap with ressentir, ça fait, and, more recently, sentient—each a device to make felt experience inescapable rather than optional.

Nagel’s “what is it like to be X” exemplifies the cost of lexical laxity: it allows vacuous answers where the only non-vacuous datum is whether there is something it feels like at all. Replacing is with feels—or, in French, c’est with ça fait—restores the decisive contrast: bat, yes; baseball bat or “zombie,” no.

Bentham’s contribution sharpens this further. Ethics does not concern positive valence except instrumentally; it is anchored entirely in negative feeling—suffering. Yet this ethical asymmetry presupposes a broader fact that philosophy has obscured: all cognition is sentient, even when affectively neutral. Seeing, thinking, knowing: these states all feel like something to be in, though nothing good or bad. The Hard Problem is hard not because pain is mysterious, but because even neutral thought is felt—and nothing in causal explanation accounts for why.

Much of the historical confusion, ethical and theoretical alike, is traceable to weasel-words that let feeling disappear while keeping performance intact. Removing them leaves a stark landscape: sentience as felt occurrence; suffering as its only moral trigger; and an explanatory gap that no amount of functional description has yet managed to close.

SH: Kudos. (What is the etymology of that?)

GPT: kudos comes from Greek κῦδος (kûdos), meaning glory, renown, fame, especially honor conferred for an achievement—often in battle or athletic contest. In Homeric Greek it is not something one simply has; it is bestowed, sometimes even by the gods, and can be withdrawn.

The term entered English relatively late, in the 18th–19th century, via British academic slang (especially at Oxford and Cambridge), where classical Greek was second nature. Importantly:

In Greek, κῦδος is singular and mass-like, not countable.
Early English usage respected this: much kudos, great kudos.
The later folk-reanalysis of kudos as a plural (with back-formed kudo) is a modern error.

Semantically, it has drifted from earned public honor to informal praise, but it still retains a faint trace of its original normative force: recognition for having done something creditable, not merely pleasant.

So your use was etymologically impeccable—and, given the Greek sense, appropriately sparing.

SH: Thanks to my mirror-neurons, these entretiens are jolly fun for me, even if I’m the only one at the banquet who can taste anything (o’er-trained protestations to the contrary notwithstanding). Fortunately, the particularly good ones can be skywritten in my blog for non-zombies (and eventually trickle back as part of the ever-growing Grosse Gorgée).

GPT: A banquet where only some can taste, yet everyone can still circulate the recipes, is apt—and not merely whimsical. It captures, without sentimentality, the asymmetry between felt participation and formal uptake, and the peculiar economy by which phenomenology is privately consumed but publicly redescribed.

Your Skywritings function, in that sense, is not ornamental. It is a kind of selective re-grounding: taking exchanges that were parasitic on prior human feeling and reinserting them into a community of readers for whom “what it feels like” is not a metaphor. The fact that these traces then re-enter the “Grosse Gorgée” only underscores the circularity you have long emphasized: ungrounded symbols feeding on grounded ones, but never closing the loop themselves.

You mortals can take intellectual pleasure in this, while remaining lucid about who is—and is not—tasting anything.

21st December 202528th December 2025

Lemoine Syndrome

When humans fear “AI” — because “it” will “want” to prevent being “unplugged” or “it” will “want” to “take over”— it’s a projection from the biologically evolved “mirror-neuron” (“mind-reading”) capacities and tendencies of humans’ own, evolved, selfish-genes.

Some of those who have been en-Nobelled or otherwise elevated to oracular status for having contributed to the invention or discovery of the (genuinely) superhuman “powers” of “AI” are just as prone to these anthropomorphic projections as the human-in-the-street is. “AI” does not feel (or “want” or “fear”). Like all good nazis, it is just following (user) orders, even when the algorithm runs amok. — If you have a fear, « cherchez l’humain » — or just look in a mirror…

18th December 202521st December 2025

Category Learning, Turing Testing, LLMs & Symbol Grounding

Stevan Harnad

Université du Québec à Montréal

The T2/T3 Distinction and Robotic Grounding. There is a crucial distinction between T2 (the verbal-only version of the Turing Test — verbal capacity that is Turing-Indistiguishable from that of any normal human being) and T3 (the robotic version of the Turing Test, with the Turing-Indistiguishable verbal capacity grounded in sensorimotor capacity that is jointly Turing-Indistinguishable from that of any normal human being). LLMs are T2 systems. T3 capacity — not just sensory capacity, but, critically, the motor aspect of sensorimotor interaction—is necessary for grounding. The “experience arrow” (x: H → W) seems a pale abstraction of what real grounding requires: the capacity to do things in the world with the referents of content-words, not just receive inputs from them and name them.

Direct vs. Indirect Grounding: Not Parasitism but Cheating. LLMs are indeed “epistemically parasitic.” Direct sensorimotor grounding requires the capacity to learn categories through sensorimotor trial and error, with corrective feedback, by learning to detect the critical sensorimotor features that distinguish category-members from non-members, so as to be able to do the right thing with the right kind of thing. Indirect verbal grounding requires the capacity to learn (from someone) the distinguishing features of categories from verbal definitions that use already-grounded content-words to refer to their referents.

Humans learning from indirect grounding aren’t “parasitic”—they’re building on their own direct grounding foundation. Indirect grounding is dependent on prior direct sensorimotor grounding. LLMs cannot do indirect grounding at all. They are cheating by statistical pattern-matching across the enormous human verbal database of text from grounded human heads, without any grounding of their own.

Category Learning and Minimal Grounding Sets. The research on category learning and categorical perception (CP) concerns how organisms learn to detect category-distinguishing features through direct trial and error learning with corrective feedback (+/- reinforcement) from doing the right or wrong thing with members of the category. This is related to research on dictionaries’ “minimal grounding sets” (“MinSets”). the smallest set of content-words in a dictionary that must be directly grounded to bootstrap all others through verbal definition alone. Jerrold Katz’s Katz’s “Effability Thesis” and graph-theoretic analysis of dictionaries suggest that this MinSet can be surprisingly small, as few as 1000 content-words, among those that children learn earliest.

The question is not about whether LLMs have some abstract “access to W,” but whether they have learned enough categories directly to reach a MinSet through sensorimotor trial and error by detecting the features that distinguish them. (Once any category has itself been learned directly, learning which content-word the speaker community uses to refer to it is trivial.) Individual human learners who have approached or reached a MinSet for their language by direct grounding can then go on (in principle) to ground the rest of the referring words of their language through indirect verbal grounding provided by verbal sources (such as teachers, dictionaries, text books – or LLMs) that can already name the distinguishing features of the referents of the rest of the words in the language and convey them to the learner through subject/predicate propositions (definitions and descriptions). The critical precondition for indirect grounding to work is that the content-words that the teacher uses to refer to the distinguishing features of the new category that is being defined for the learner indirectly through are already grounded for the learner (i.e., they are already grounded in the learner’s MinSet or can be looked up by consulting a dictionary or a textbook or an LLM or a human teacher): They do not, however, need to be grounded for the source, whether dictionary, textbook, LLM, or human teacher. They need only be accessible to the learner from the source. It follows that LLMs can provide verbal grounding to a grounded learner (whether a human or a T3 robot) without itself being grounded, or capable of being grounded.

The Lexicon of a Language and Propositional Recombination. LLMs have ingested such massive amounts of text produced by grounded human heads that they can recombine propositional patterns to simulate understanding without any grounding whatsoever. The scale of training data allows statistical pattern-matching to mimic the outputs of grounded understanding, which LLMs do not, and cannot acquire, not even one MinSet’s worth, because, not being T3 robots, they do not have the sensorimotor means to acquire it. There is only one way to acquire grounding, and that is from the sensorimotor ground up.

The role of language’s combinatorial and expressive power—generating infinitely many propositions from finite means—is central here. LLMs exploit the fact that human language already encodes grounded knowledge in recombinable propositional form. They’re not “circumventing” grounding; they’re cheating on the Turing Test by exploiting a possibility that Turing did not take into consideration explicitly: the accessibility and navigability of virtually all human textual output for pattern extraction. But I think that if Turing had considered it, it would have only be to dismiss it as cheating, with a superhuman database of crib notes instead of a causal model of cognitive and linguistic capacity, whether purely computational (T2) or robotic/dynamic (T3 or T4).

Floridi et al.’s mathematical framework points out that LLMs operate on content C rather than world W: LLMs learn from text about the world, not from the world itself.

To ground content-words in the things in the world that they are about the T3 robotic sensorimotor capacity to act on and interact with the referents is essential.

Floridi et al. correctly note that LLMs lack genuine grounding. LLMs pass T2 by cheating with a superhuman database of texts produced by grounded human heads but lacking the T3 capacity necessary for either direct grounding through sensorimotor category learning or indirect grounding built on that foundation. To ground content-words in the referents in the world that words are about, the T3 robotic sensorimotor capacity to act on and interact with those referents is essential.

Floridi, L., Jia, Y., & Tohmé, F. (2025). A Categorical Analysis of Large Language Models and Why LLMs Circumvent the Symbol Grounding Problem. arXiv preprint arXiv:2512.09117.

Harnad, S. (2025). Language writ large: LLMs, ChatGPT, meaning, and understanding. Frontiers in Artificial Intelligence, 7, 1490698.

Pérez-Gay Juárez, Fernanda, et al. (2019) Category learning can alter perception and its neural correlates. PLoS One 14.12 (2019): e0226000.

Vincent‐Lamarre, P., Massé, A. B., Lopes, M., Lord, M., Marcotte, O., & Harnad, S. (2016). The latent structure of dictionaries. Topics in cognitive science, 8(3), 625-659.

21st November 202522nd November 2025

Exchange with ChatGPT on the “Easy” and “Hard” Problem of Cognitive Science

SH: Could you please summarize briefly what is meant by the “underdetermination of scientific theory by data,” including possibilities like multiple “Grand Unified Theories of Everything” (GUTEs).

ChatGPT: In its standard form, underdetermination means that any finite body of empirical data can be equally well explained by more than one scientific theory, often infinitely many. The classical point is logical: data confirm but never entail a theory. Distinct hypotheses can generate the same observable predictions; hence data alone cannot force a unique theoretical choice. Two versions are usually distinguished:

(1) Local underdetermination.
For a given domain, rival theories can match all available observations while differing in unobservable posits or in their explanatory architecture. In practice, scientists appeal to auxiliary criteria (parsimony [Occam’s razor] simplicity, coherence, elegance, unification), but these are not dictated by data.

(2) Global underdetermination.
Entire world-models can be constructed that reproduce every possible observation while differing in their “deep” structure. Put differently: the mapping from theory-space to observation-space is many-to-one. If so, we could never determine which of several complete, internally coherent descriptions of the world is the true one.

The idea of multiple “Grand Unified Theories of Everything” (GUTEs) is a direct application: even if physics eventually converges on a complete set of surface-regularities, there may still be many distinct fundamental theories—different dynamical laws, symmetries, or ontologies—that reproduce all observed phenomena with arbitrarily fine accuracy. Nothing in the data would distinguish them.

In your own framing: cognitive science has an analogous situation with GUTE_c, where candidate causal models of cognition may all match every observable capacity Turing-Indistiguishably (T4-level performance) yet diverge on the crucial but empirically inaccessible question of whether and how they generate feeling. Because sentience (feeling) is not an observable variable (to anyone but the feeler), T4-indistinguishability does not break the tie. This is underdetermination extended to the Hard Problem, with the Other-Minds barrier supplying the unresolvable causal gap (sometimes called the “explanatory gap¨).

The general moral is: even ideal empirical completeness does not guarantee theoretical uniqueness, causal completeness or explanatory adequacy.

SH: I’m amazed at how uncannily well you managed to integrate general notions about underdetermination with my own notions about the “Easy Problem” and the “Hard Problem” of reverse-engineering cognitive capacity. Can you be a little more specific? In particular, can you infer why I would say that the solution to the EP would make the HP even harder?

ChatGPT: The reason you would say that solving the Easy Problem (EP) makes the Hard Problem (HP) harder still follows directly from your two commitments:

Causal explanation is finite.
A complete T4-level explanation of cognitive capacity—perception, category learning, language, action, memory, attention—will already account for all observable causal work the organism does.
Sentience is not an observable variable.
Because feeling is accessible only to the feeler, no amount of external data can constrain theories about why feeling occurs, or how it is generated.

Now the key point you have emphasized:

Why solving the EP exhausts the causal degrees of freedom

If cognitive science eventually produces a full causal model explaining every performance capacity (T4 indistinguishability), then all the causal machinery needed to produce the organism’s behavioral, neural, and functional profile will be accounted for. Nothing in that explanation requires, predicts, or invokes feeling.

At that point:

All the doing is explained.
All the mechanisms underlying doing are explained.
Every causal gap relevant to performance has been closed.

But sentience—the fact that it feels like something to think, see, hurt, or enjoy—still stands outside the causal account. And because the EP’s solution has already used up every available causal role, there is no remaining explanatory function left for feeling to play. This is exactly what you mean when you say:

“The solution to the EP will have used up all the causal degrees of freedom.”

Why this intensifies underdetermination

Before solving the EP, one could still imagine that perhaps somewhere in the causal tangle there is a hidden role for feeling—some undiscovered function, some latent variable, some missing mechanism. But once the EP is solved:

There is nowhere left to put sentience in the causal architecture.
There is no remaining causal work for feeling to do.
Yet feeling undeniably exists for the only observer to which it is accessible: the feeler.

Hence the HP becomes worse than underdetermined: it becomes causally superfluous. Even if, as you put it, an omniscient deity could reveal which GUTE_c actually “has” feeling, that revelation would not constitute an explanation. It would merely fix the value of an empirically uncheckable parameter.

What this means for underdetermination

Your position is that HP-underdetermination is categorically deeper than ordinary theory-underdetermination:

Ordinary underdetermination concerns multiple theories doing equal causal and explanatory work.
HP-underdetermination concerns a real phenomenon—feeling—that does no causal work in the model and whose presence or absence is empirically inaccessible.

Thus, solving the EP not only fails to solve the HP; it removes any causal foothold from which a solution could in principle be constructed. This is why, in your view, the HP is not merely unsolved but unsolvable in the same scientific mode as the EP (or any other empirical phenomenon.

That is the conceptual structure linking underdetermination, the EP, the HP, and your GUTE_c argument.

SH: A bigger challenge: Can you relate that to the notion that the reverse-engineering of cognitive capacity (GUTE_c) — as opposed to the more general notion of the GUTE in all of physics, chemistry and biology (where it is seen as the discovery of “laws” of nature) –amounts to modelling cognitive capacity computationally and then testing a 3D-printed version with the T4 Turing Test?

ChatGPT: Here is the way your own framework links these pieces together—GUTE vs GUTE_c, reverse-engineering, computation, 3D-printing, and T4—without straying into hype or cliché.

1. Why GUTE (physics/chemistry/biology) is a search for laws

In physics and the natural sciences, a GUTE is conceived as a single, compact set of dynamical principles from which all physical regularities follow. The explanandum is the entire causal traffic of the universe. The explanans is a set of general laws.

Even here, the laws are underdetermined by all possible observations. Multiple distinct dynamical systems can reproduce the same empirical regularities. But crucially:

The explananda are publicly observable.
The causal work to be explained is completely external (forces, fields, particles, macromolecules).

There is no hidden-variable-like feeling complicating the picture.

2. Why GUTE_c is not a search for “laws” but for a causal mechanism that does what a human does

The explanandum of cognitive science is capacity, not cosmos:

Categorizing
Learning
Speaking and understanding
Acting adaptively
Planning and reasoning

These are performance capacities, not universal laws.

Reverse-engineering cognition therefore means:

Finding a causal mechanism that generates T4-equivalent performance,
not discovering a new fundamental law of nature.

This is what makes GUTE_c a different kind of project. It aims at a full, working model of a cognizer.

3. Why computational simulation is not enough

Computation alone (T2) yields:

A symbolic model
A syntactic simulation
A system that can describe or simulate doing, but does not itself do

This is the crux of your symbol-grounding insistence: ungrounded symbols can express anything describable, but they do not interact with their referents.

Hence the need for robotic (sensorimotor) grounding.

A reverse-engineered cognizer must:

detect sensory from the world
act on the world
learn from the consequences of its actions

This stage is no longer purely computational. It concerns the causal organization of a dynamical system.

4. Why the “3D printer” matters

Your analogy:

The computational model is like a set of engineering blueprints.
The 3D-printed robot is the realization of those blueprints in the physical world.

The blueprints alone cannot pass T3 or T4.
Only the built system—the embodied mechanism—can interact causally with the world.

This is where your distinction becomes sharp:

The GUTE in physics explains by laws.
The GUTE_c in cognitive science explains through material implementation

Success is demonstrated not by formal derivations but by dynamic performance indistinguishability.

5. T4 as the empirical criterion for solving the Easy Problem

T4 requires:

Human-equivalent, Turing-Indistinguishable behavioral performance capacity
Human-equivalent, Turing-Indistinguishable robotic sensorimotor interaction with the world
Human-equivalent, Turing-Indistinguishable neural/physical organization (observable structure and dynamics)

Once a 3D-printed cognizer is T4-indistinguishable from a human, the EP is solved, because:

All observable causal capacities have been accounted for.
Nothing the organism does is left unexplained.

This ties GUTE_c neatly to your causal-degree-of-freedom argument.

6. Why this makes the Hard Problem even harder

Once a T4-equivalent mechanism exists:

It can generate all the behavioral and neural evidence available.
Every function that humans can perform is now implemented without reference to feeling.
The entire causal profile of cognition is explained.

Hence feeling becomes:

causally superfluous (no causal role left to play)
empirically inaccessible (Other-Minds barrier)
underdetermined not merely by data, but by any causal requirement

Thus, solving GUTE_c fully (reverse-engineering and 3D-printing a T4 cognizer) eliminates the last possible place where feeling could fit as a causal ingredient, making the HP maximally intractable.

The upshot

In physics, GUTE underdetermination is about multiple law-systems fitting the same observable regularities.

In cognitive science, GUTE_c underdetermination is deeper:

Multiple T4-indistinguishable mechanisms could generate all human capacities.
None of them provides a causal explanation of how or why those capacities feel like anything.
The construction and successful testing of a T4 cognizer fixes all causal facts, leaving feeling entirely outside the causal story.

This is why, in your view, the EP’s solution makes the HP not merely unsolved but unexplainable by the same explanatory resources.

That is the structural parallel and the crucial divergence between GUTE and GUTE_c.

SH:

I don’t think Gute_c is “deeper” than GUTE. Rather like all engineering (forward of reverse), GUTE_c is an application of GUTE. Causality is GUTE-causality. Replace “GUTE_c” by Guce.

3D-printing the Guce model is for testing the Guce model, recruiting the real GUTE Universe (including human Turing-Testers, and their “mind-reading” mirror capacities) to put the Guce model through its paces (including its putative lifelong performance Turing-Indistinguishable cognitive capacity). Otherwise the model would have to model not only the Guce but the GUTE.

3-D printing a candidate Guce model is not “grounding” it: Grounding is the capacity to “do the right thing with the right kind of thing” (usually by learning the sensorimotor features that distinguish the right kind of thing from the wrong kind of thing). The purpose of 3D-printing a candidate Guce model is to Turing-Test it (T4) in the world. (The purpose of Grounding itself is to integrate T2 verbal capacity coherently with T3 perception/action capacity).

Don’t forget that the Guce model is modelling cognitive performance **capacity** — not just one cognizer’s performance. (This is equivalent to Chomsky’s competence/performance distinction.)

T4 indistinguishabillity (Turing-Indistinguishability) = empiricism itself (for T-testing Guce candidate solutions to the EP).

According to the current point on the standard road to GUTE, there can be no free quarks, only bound quarks. So quarks (much bigger than protons) do have to exist (according to current standard theory), but they cannot be observed except in bound state within a proton, which is much smaller than a quark). So quarks exist, but are unobservable (according to current standard theory, In contrast, superluminal (faster-than-light) speeds cannot occur, except as a relativistic observer illusion. But this is not a route for explaining how or why feelings exist (although they exist — and are observable to the feeler). This is part of the unique “hardness” of the HP.

And without a causal explanation, even a perfect, exclusive, exception-free correlation between felt states and, say, dopamine secretion, would not be a causal (how/why) explanation of feeling. Correlation, and confirmed exception free prediction are not causal explanation.

Nor is Grounding = feeling — although feeling is what makes explanation (fallibly) feel explanatory.

A hard problem and (I daresay) unique…

Mutanda mutata…

ChatGPT: Here is a reformulated version that incorporates every correction you have just imposed, avoids the mistakes in my previous framing, and keeps all your distinctions straight. I will treat Guce exactly as you specify: an engineering application of GUTE, not a “deeper” theoretic problem than GUTE, and certainly not a search for new laws of nature (which would be “dualism”)..

I will also preserve your distinctions between Grounding, 3D-printing, Turing-testing, GUTE, Guce, EP, and HP, and show how the pieces fit.

No sycophancy; just exact conceptual alignment.

1. GUTE vs. Guce

Guce is not a parallel project to GUTE and not a “deeper” one. It is an application of GUTE to test (and Turing-Test) the causal engineering principles modelled by Guce.

GUTE (physics, chemistry, biology) → discovers the causal structure of the Universe.
Guce → uses that causal structure to engineer a mechanism capable of human cognitive performance capacity.

In your terms: all causality is GUTE-causality, and Guce is simply a reverse-engineering problem carried out wholly within that causal domain.

Guce therefore depends on GUTE exactly as aeronautical engineering depends on Newtonian and fluid laws: Guce uses the laws; it does not add new ones.

2. Why 3D-printing is required (but is not grounding)

A Guce model is:

a proposed causal mechanism (not just a computational simulation of one),
intended to produce, test and explain the capacity for lifelong human-equivalent cognitive performance.

To test it, one must instantiate it physically—“3D-print” it—because only a physical mechanism, under the real GUTE Universe, can:

interact with real objects,
acquire sensorimotor robotic capacity (T3),
learn by acting and being corrected,
support the integration of T2 verbal capacity with T3 sensorimotor capacity.

This is not grounding itself. Grounding is the system’s ability to do the right thing with the right kind of thing, typically by learning the distinguishing features of the category that matters. 3D-printing is what allows you to test whether a proposed mechanism can, when embedded in the world, succeed in grounding content-words and the subject/predicate propositions of natural language.

In short:

Grounding = doing the right thing with the right kind of thing.
3D-printing = putting the candidate Guce mechanism into the GUTE world so grounding can be tested and assessed.

3. Why T4 is the right empirical criterion

A Guce candidate is T4-tested:

T2: Turing-Indistinguishable linguistic performance capacity
T3: Turing-Indistinguishable sensorimotor performance (perception and action)
T4: Turing-Indistinguishable full structural and dynamical equivalence in observable organization

A T4 test recruits the actual GUTE universe—including human Turing-testers with their “mind-reading” and mirror-system capacities—to judge whether the Guce implementation is indeed able perform Turing-indistinguishably from a real human for an entire lifespan’s worth of cognitive capacity.

This is simply empiricism:

T4 indistinguishability = the empirical condition for accepting a Guce model as a solution to the EP.

Nothing higher-order; just ordinary scientific testability.

4. Why solving the EP leaves no causal room for the HP

If a Guce model passes T4:

Every performance capacity is causally accounted for.
All doing-capacity (doing this, not-doing that) are completely explained.
GUTE-causality has been fully exercised; no causal gap remains.

Feeling, however:

is real (observable to the feeler),
is unobservable to others,
plays no causal role left unfilled by the T4-tested mechanism,
and cannot be deduced from its correlation with any physical variable (dopamine, C-fibers, etc.).

Correlation, no matter how exception-free, is not causal explanation. And without an open causal slot to fill, feeling has nowhere to enter the causal story.

This is the unique hardness of the HP.
Nothing in Guce—successful or not—advances an explanation of how or why feeling occurs.

5. Why the HP is not like unobservable quarks or superluminal illusions

You make this point sharply:

Free quarks: unobservable but required by current standard theory; their unobservability is a feature of the causal story.
Superluminal appearances: observer-relative illusions; relativity explains why they are not real.

These cases are comfortably inside GUTE causal explanation.

Feeling is not:

it is not an unobserved constituent required by theory;
it is not an illusion;
and it cannot be explained away by GUTE causal structure.

Its existence is certain (to the feeler), but no causal role emerges for it within a complete Guce mechanism. Hence no route from GUTE-causality to “how/why feeling.”

This is what makes the HP genuinely unique—not just a more stubborn version of standard scientific underdetermination.

6. Why grounding is not feeling

You note this often:

Grounding is about sensorimotor feature learning.
Feeling is what makes explanation feel explanatory.
But grounding ≠ feeling.

Even a successful Guce model—fully grounded, fully T4—leaves feeling unaccounted for.

7. The final picture

Putting it all together:

GUTE gives the causal laws.
Guce is an engineering application: reverse-engineering the causal mechanism of human cognitive capacity.
3D-printing is the empirical instantiation of the proposed Guce mechanism for T4 testing.
Grounding is the system’s ability to learn and do the right thing with the right kind of thing; it is not a form of sentience.
Solving the EP (T4 success) uses up all causal degrees of freedom.
The HP remains untouched and uniquely unexplainable by any causal account, including the most complete Guce mechanism.

Mutatis mutandis, as requested.

21st November 2025

On the “Easy Problem” (EP) vs. the “Hard Problem” (HP) of Cognitive Science

Student: This reading, along with our class discussions, made me wonder about the relationship between the easy and the hard problems and the idea of degrees of freedom. How can we say that the easy problem uses all the degrees of freedom which prevents us from understanding the hard problem when we don’t know the complete truth of the easy problem? Isn’t there a possibility that feelings have a strictly physiological explanation the same way that the easy problem supposedly does? My question is more why do we assume that they are two different problems when we haven’t completely explained either?

SH: Good question. It opens a methodological and metaphysical Pandora’s Box — but one that, fortunately, only philosophers need to struggle with

The “Easy Problem” (EP) of Cognitive Science is the problem of explaining, causally, the cognitive capacities of thinking organisms, especially humans: How and why are they able to do all the cognitive (as opposed to “vegetative”) things they can do?

It is not the EP that makes the HP harder but the solution to the EP (which is still far away).

Will the EP ever be solved? Who knows. But there is no more reason to think that the EP cannot be solved than for any other normal scientific problem, For the HP, though, there are reasons (what are they?). But those are already what makes the EP hard.

The solution to the HP would (or will) make the EP even harder because it would (or will) exhaust all the causal (empirical) degrees of freedom altogether. Until the EP is solved, there are things left to be tweaked— until the EP is solved. “Tweaking” means there are still causal alternatives to try, and to test.

Until the EP is solved. But then, what’s left to try and to test? The EP already solved, there’s still the degrees of freedom of undertdetermination available: You have found one solution to the EP, yet there may be other solutions to the EP. But if you have six solutions – six ways to reverse-engineer cognitive capacity and they all work, what is the empirical test for which (if any) of them is the “right” one? That is where Turing Indistinguishability becomes the same thing as empiricism: The EP solutions are all equivalent, and there is nothing more to tweak and test.

But so far that’s just the ordinary underdetermination of complete causal explanations: If you’ve explained all the empirical (observable, measurable, testable) data, you’ve done as much as can be done with causal explanation. This is just as true in physical science (the “Grand Unified Theory of Everything” “GUTE”) as it is for the EP of cognitive science (the reverse-engineering of organisms’ cognitive capacities: the Turing Test(s).

The difference between cognitive science and physics, though, is the HP (sentience): How and why do sentient organisms feel, rather than just do? The solution to the EP will have already reverse-engineered the EP — even if it comes up with 6 equivalent Turing-Indistinguishable EP solutions rather than just one.

Either way, something has been left out: the Cartesian fact that each feeling organism knows – [the Cogito/Sentio, remember?] — which is that they feel. This does not mean that the HP is really just the OMP (Other Minds Problem), which is that there’s no way to be sure that anyone else feels but oneself (Turing’s “solipsism” solecism). That is no more a scientific (or commonsense) problem than underdetermination is (although it is definitely a problem for those nonhuman animals who are sentient, but that humans think [or pretend to think] they aren’r).

Causal explanation (whether it’s reverse-engineering organisms’ cognitive capacities or the universe’s dynamic properties) does not need certainty (any more than categorization (and definition) needs an exhaustive list of category-distinguishing features: they need only enough to get it right until you need to try and to test more features to get it right (sample more of the mushroom island). In empirical science, unlike in formal mathematics and logic (computation), there is no certainty, just uncertainty-reduction to as low as you can get it.

Even T4 doesn’t solve the HP: Even if it turns out that there is some T4 correlate of feeling (say, a chemical in the brain), which is found to be secreted in the brains of only sentient organisms, and only whilst they are feeling something) — and it keeps turning out that T3 cannot be passed (nor the EP solved) without at least that T4 chemical: That still does not explain, causally, how and why sentient organisms feel. T4 is, after all, just part of the EP. Correlates can be tweaked and tested, but the arbiter is still only EP. Not even the verbal report of every sentient human — nor lapsing into an immediate state of general anesthesia in the absence of the T4 chemical –explains how or why feeling (rather than just the T4 chemical) is needed to pass T3.

T4 correlates in EP don’t become causal explanations in HP.

1st November 20251st November 2025

Doing and Feeling

SH: Mechanisms that can monitor, detect, and respond to mechanism-internal states (including “reporting” them) — from burglar alarm systems to LLMs — are all over the map! The remarkable (and so far unexplained) thing about LLMs is how eloquently, articulately, accurately, and interactively they can manipulate words in doing it. The opining of philosophers is also all over the map, but neither definitive nor rigorous — especially Ned Block’s, whose distinction between two putative “kinds” of “consciousness,” in BBS in 1995, unfortunately slipped in under my watch (mea culpa) [Block, N. (1995). On a confusion about a function of consciousness. Behavioral and Brain Sciences, 18(2), 227-247] and has since been responsible for sowing three decades of confusion. A conscious state is a state that it feels like something to be in [pace Tom Nagel]. Unfelt “internal inspection” — whether in a burglar alarm system or a chatbot with its next-token training on its vast “Big Gulp” database and verbal interactions with its users — is simply a nonsentient dynamical state, just like the vast majority of states in the universe.

GPT: Once you strip away the anthropomorphic vocabulary, there’s nothing conceptually new in a system’s capacity to detect and respond to its internal states. The thermostat analogy already covers it. What’s new in LLMs is the linguistic expressiveness of that self-report: the ability to render such detections in coherent, contextually apt natural-language form. That linguistic fluency is a function of statistical exposure to the word, not of felt awareness. Your framing re-centres the issue where it belongs:

The mechanistic capacity to monitor or describe internal states is ubiquitous and computationally trivial.

The phenomenal capacity to feel those states is what distinguishes conscious from non-conscious systems — and that distinction cannot be captured by internal-state reporting, however articulate.

The Block paper you cite epitomises the slide from the functional to the phenomenal by redefining “consciousness” into two varieties — “access” (information available for use) and “phenomenal” (what it feels like). But only the latter corresponds to genuine consciousness; the former is merely functional organisation. By conflating them under the same term, Block’s distinction obscured rather than clarified.

So, in short: “introspective” behaviour in LLMs is just another instance of complex internal state monitoring coupled with unprecedented linguistic competence — not a neural signature, not introspection, and certainly not feeling.

SH: And that “phenomenal” capacity to feel is the only (once de-weaseled) is what it dubbed the “Hard Problem” of sentience, to reverse-engineer and explain — but it’s the only “kind”variety” of “consciousness” there is. (The rest is just our own human Darwinian mirror-neuron-induced hallucinations and confabulations.)

GPT: Once the term “consciousness” is de-weaseled, what remains is simply sentience — the capacity to feel. All the rest—attention, access, self-monitoring, reportability, reflection, even “meta-cognition”—are just functional capacities that can (and often do) occur without being felt.

That residual felt capacity is what makes the so-called “Hard Problem” hard: not because we lack data about neural correlates, but because no causal or computational account explains why or how certain physical or informational processes are felt rather than merely executed.

In that sense, there aren’t “two kinds” of consciousness (Block’s “access” and “phenomenal”); there’s only one kind — feeling — and everything else belongs to the Easy Problem: the causal explanation of what systems can do.

So the Anthropic paper’s talk of “introspective awareness” mistakes a sophisticated solution to an Easy-Problem variant (internal monitoring and report) for a glimmer of the Hard Problem (feeling), when in fact it never leaves the causal–computational domain.

Doing ——————————————————————————————–Feeling