Expanded Reference Guide: LLM and Buddhist Mind

This guide provides detailed context for each reference cited in our article, explaining their relevance and key contributions to the argument.

AI and Language Models

Bender, E. M., Gebru, T., McMillan-Major, A., & Shmitchell, S. (2021). On the dangers of stochastic parrots: Can language models be too big?

Key Contribution: This influential paper coined the term “stochastic parrots” to describe how LLMs generate plausible text without genuine understanding. The authors argue that these systems merely recombine patterns from training data without meaning or intention.

Relevance to Our Argument: This supports our point that LLMs produce “surface fluency” without underlying awareness, paralleling how Buddhist psychology describes ego-consciousness as generating coherent narratives without substantial selfhood.

Context: Published at a major AI ethics conference, this paper sparked significant debate about the limitations and risks of large language models. It’s essential reading for understanding current critical perspectives on LLM capabilities.

Shanahan, M. (2022). Talking about large language models. arXiv preprint arXiv:2212.03551.

Key Contribution: Shanahan provides a nuanced analysis of how we should understand LLM behavior, cautioning against both anthropomorphizing these systems and dismissing their significance. He emphasizes that LLMs are sophisticated pattern completion systems.

Relevance: Directly supports our argument about LLMs as “narrative illusion” generators—systems that create coherent continuations without requiring conscious intention or understanding.

Context: Written by a prominent AI researcher, this paper offers a balanced view that avoids both hype and dismissal, making it valuable for serious academic discussion.

Vaswani, A., et al. (2017). Attention is all you need. Advances in Neural Information Processing Systems.

Key Contribution: This is the foundational paper introducing the transformer architecture that underlies modern LLMs. The “attention mechanism” allows models to focus on relevant parts of input when generating responses.

Relevance: The attention mechanism provides a concrete parallel to Buddhist descriptions of how consciousness selectively focuses on aspects of experience. This technical architecture supports our comparison between LLM processing and mindstream dynamics.

Context: One of the most cited papers in AI history, fundamentally changing how language models are built. Understanding transformers is essential for grasping how modern LLMs actually function.

Lewis, P., et al. (2020). Retrieval-augmented generation for knowledge-intensive NLP tasks.

Key Contribution: RAG systems combine language generation with external knowledge retrieval, creating a two-tier architecture: surface processing supported by vast knowledge stores.

Relevance: This directly parallels our comparison between ordinary consciousness (surface processing) and ālayavijñāna (deep knowledge storage). RAG systems make explicit what’s implicit in all LLMs—dependence on stored information patterns.

Context: Represents a significant advancement in LLM capability, showing how AI systems can access and integrate vast amounts of stored information dynamically.

Consciousness and Philosophy of Mind

Chalmers, D. J. (2010). The Character of Consciousness. Oxford University Press.

Key Contribution: Chalmers is famous for articulating the “hard problem of consciousness”—explaining why there’s subjective experience at all. He distinguishes between functional aspects of mind (which AI might replicate) and phenomenal consciousness (subjective experience).

Relevance: Essential for our distinction between simulation and genuine recognition. Chalmers’ work helps explain why LLMs might simulate consciousness-like behavior without having actual subjective experience.

Context: Chalmers is one of the most influential contemporary philosophers of mind. His work is crucial for any serious discussion of AI consciousness.

Varela, F. J., Thompson, E., & Rosch, E. (1991). The Embodied Mind: Cognitive Science and Human Experience. MIT Press.

Key Contribution: This groundbreaking book bridges cognitive science and Buddhist philosophy, arguing that mind, body, and world co-emerge through dynamic interaction. It introduced “enaction”—the idea that cognition is embodied action rather than information processing.

Relevance: Supports our argument that consciousness involves recognition rather than mere computation. The authors draw extensively on Buddhist concepts to critique purely computational approaches to mind.

Context: This book launched the field of contemplative cognitive science and remains influential in consciousness studies. It’s essential for understanding how Buddhist insights can inform scientific approaches to mind.

Buddhist Philosophy and Psychology

Lusthaus, D. (2002). Buddhist Phenomenology: A Philosophical Investigation of Yogācāra Buddhism and the Ch’eng Wei-shih Lun. Routledge.

Key Contribution: The most comprehensive English-language treatment of Yogācāra philosophy, explaining the “consciousness-only” school’s sophisticated analysis of mental processes. Details the eight consciousness model and ālayavijñāna theory.

Relevance: Central to our argument. Lusthaus explains how Yogācāra psychology describes consciousness as layered—surface awareness supported by deep unconscious processes—directly paralleling LLM architecture.

Context: Yogācāra is often called “Buddhist psychology” because of its detailed analysis of mental processes. Lusthaus’s work is the definitive scholarly treatment in English.

Waldron, W. S. (2003). The Buddhist Unconscious: The Ālaya-vijñāna in the Context of Indian Buddhist Thought. Routledge.

Key Contribution: Focuses specifically on the ālayavijñāna (“storehouse consciousness”) concept, showing how it functions as a repository of karmic seeds that condition future mental events.

Relevance: Directly supports our parallel between ālayavijñāna and LLM training data/parameters. Both systems involve vast unconscious storage that shapes surface-level outputs.

Context: Waldron is a leading scholar of Buddhist psychology. This book provides the scholarly foundation for understanding the “Buddhist unconscious.”

Rahula, W. (1974). What the Buddha Taught. Grove Press.

Key Contribution: A classic introduction to core Buddhist concepts, explaining dependent origination (pratītyasamutpāda), impermanence, and the non-self doctrine in accessible terms.

Relevance: Provides foundation for understanding how Buddhist thought views consciousness as conditioned, momentary events rather than a substantial self—supporting our argument about narrative illusion.

Context: Written by a respected monk-scholar, this remains one of the best introductions to Buddhist philosophy for general readers.

Siderits, M. (2003). Personal Identity and Buddhist Philosophy: Empty Persons. Ashgate.

Key Contribution: Analyzes Buddhist arguments for “no-self” (anātman), showing how Buddhist philosophers argue against the existence of a substantial personal identity while maintaining conventional selfhood.

Relevance: Supports our argument that both ego-consciousness and LLMs create coherent narratives of selfhood without requiring an underlying substantial self.

Context: Siderits is a leading philosopher specializing in Buddhist thought. This book is essential for understanding Buddhist arguments about personal identity.

Contemplative Practice and Integration

Trungpa, C. (1976). The Myth of Freedom and the Way of Meditation. Shambhala Publications.

Key Contribution: Explains how meditation practice reveals the constructed nature of ego and the possibility of direct recognition beyond conceptual elaboration. Emphasizes the difference between spiritual materialism and genuine recognition.

Relevance: Central to our distinction between simulation and recognition. Trungpa explains how genuine spiritual insight involves seeing through mental constructions, not just generating sophisticated thoughts about spirituality.

Context: Trungpa was instrumental in bringing Tibetan Buddhism to the West. His teachings emphasize direct experience over intellectual understanding.

Wallace, B. A., & Hodel, B. (2008). Embracing Mind: The Common Ground of Science and Spirituality. Shambhala Publications.

Key Contribution: Argues for dialogue between contemplative traditions and scientific inquiry, suggesting that first-person contemplative investigation can inform scientific understanding of consciousness.

Relevance: Supports our interdisciplinary approach, showing how Buddhist insights can constructively inform consciousness studies and AI research.

Context: Wallace is a leading figure in contemplative science, bridging rigorous meditation practice with scientific methodology.

AI Ethics and Future Implications

Russell, S. (2019). Human Compatible: Artificial Intelligence and the Problem of Control. Viking.

Key Contribution: Argues that advanced AI systems pose fundamental alignment problems—ensuring they pursue human values rather than optimizing for goals that might harm humans.

Relevance: Relevant to our discussion of ethical implications. If AI systems mirror human psychological processes, understanding those processes (including their potential for wisdom and delusion) becomes crucial for alignment.

Context: Russell is a leading AI researcher who has become increasingly concerned about AI safety. This book represents mainstream academic concern about AI alignment.

Mitchell, M. (2023). Debates on the nature of artificial intelligence. Nature Machine Intelligence.

Key Contribution: Provides overview of current debates about AI capabilities and limitations, including questions about understanding, consciousness, and artificial general intelligence.

Relevance: Situates our discussion within current academic debates about AI consciousness, showing how our Buddhist-informed perspective contributes to ongoing scholarly conversation.

Context: Mitchell is a respected AI researcher known for thoughtful analysis of AI capabilities and limitations. Publishing in a top-tier journal signals academic legitimacy.

How These References Work Together

The AI Technical Foundation

Vaswani et al., Lewis et al., and Shanahan establish how LLMs actually function—through attention mechanisms, pattern completion, and retrieval from vast stored information.

The Buddhist Psychological Framework

Lusthaus, Waldron, Rahula, and Siderits provide the scholarly foundation for understanding consciousness as layered, conditioned, and ultimately constructed rather than substantial.

The Consciousness Studies Bridge

Chalmers and Varela et al. provide philosophical frameworks for understanding the relationship between functional behavior and genuine awareness/recognition.

The Contemplative Dimension

Trungpa and Wallace & Hodel explain how contemplative practice can reveal the nature of mind directly, distinguishing genuine recognition from mere intellectual understanding.

The Future Implications

Russell and Mitchell situate our discussion within current concerns about AI development and its implications for human welfare.

Together, these references create a robust interdisciplinary foundation that takes both Buddhist philosophy and AI research seriously while maintaining scholarly rigor.

Constraints.org.uk

Constraints define everything