A Philosophical Introduction to Language Models -- Part I: Continuity With Classic Debates
AI-generated Key Points
- LLMs can reproduce information and creatively combine patterns to produce original outputs
- The flexible generalization capabilities of LLMs in semantic spaces contribute to their efficiency and resilience compared to rule-based systems
- The paper introduces LLMs to philosophers, emphasizing the importance of engaging with fundamental questions about artificial intelligence
- It discusses the origins of LLMs in early AI research and the divide between symbolic and stochastic approaches in natural language processing
- Noam Chomsky's transformational-generative grammar and the development of rule-based syntactic parsers are highlighted as influences on LLMs
- Renewed debates surrounding artificial neural networks, deep learning advancements, and the success of LLMs are explored
- The significance of behavioral evidence from benchmarks and targeted experiments in shaping discussions about LLMs is emphasized
Authors: Raphaël Millière, Cameron Buckner
Abstract: Large language models like GPT-4 have achieved remarkable proficiency in a broad spectrum of language-based tasks, some of which are traditionally associated with hallmarks of human intelligence. This has prompted ongoing disagreements about the extent to which we can meaningfully ascribe any kind of linguistic or cognitive competence to language models. Such questions have deep philosophical roots, echoing longstanding debates about the status of artificial neural networks as cognitive models. This article -- the first part of two companion papers -- serves both as a primer on language models for philosophers, and as an opinionated survey of their significance in relation to classic debates in the philosophy cognitive science, artificial intelligence, and linguistics. We cover topics such as compositionality, language acquisition, semantic competence, grounding, world models, and the transmission of cultural knowledge. We argue that the success of language models challenges several long-held assumptions about artificial neural networks. However, we also highlight the need for further empirical investigation to better understand their internal mechanisms. This sets the stage for the companion paper (Part II), which turns to novel empirical methods for probing the inner workings of language models, and new philosophical questions prompted by their latest developments.
Ask questions about this paper to our AI assistant
You can also chat with multiple papers at once here.
Assess the quality of the AI-generated content by voting
Score: 0
Why do we need votes?
Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.
Similar papers summarized with our AI tools
Navigate through even more similar papers through a
tree representationLook for similar papers (in beta version)
By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.
Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.