CoAuthor: Designing a Human-AI Collaborative Writing Dataset for Exploring Language Model Capabilities

AI-generated keywords: Large language models Interaction design Generative capabilities CoAuthor dataset GPT-3

AI-generated Key Points

  • Large language models (LMs) in interaction design
  • Proposal of curated datasets to examine generative capabilities
  • Introduction of CoAuthor dataset for exploring GPT-3's abilities
  • Interactions between 63 writers and four instances of GPT-3 across 1445 writing sessions
  • Surveys conducted after each session to assess capabilities and limitations of LMs
  • Insights into GPT-3's language generation, ideation, and collaboration capabilities
  • Principled discussion around promises and pitfalls of LMs in interaction design
  • GPT-3 generates fluent text with fewer errors compared to human writers
  • Provides new ideas to writers and can collaborate effectively with them
  • CoAuthor dataset is publicly available for further analysis
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Mina Lee, Percy Liang, Qian Yang

Published as a conference paper at CHI 2022
License: CC BY 4.0

Abstract: Large language models (LMs) offer unprecedented language generation capabilities and exciting opportunities for interaction design. However, their highly context-dependent capabilities are difficult to grasp and are often subjectively interpreted. In this paper, we argue that by curating and analyzing large interaction datasets, the HCI community can foster more incisive examinations of LMs' generative capabilities. Exemplifying this approach, we present CoAuthor, a dataset designed for revealing GPT-3's capabilities in assisting creative and argumentative writing. CoAuthor captures rich interactions between 63 writers and four instances of GPT-3 across 1445 writing sessions. We demonstrate that CoAuthor can address questions about GPT-3's language, ideation, and collaboration capabilities, and reveal its contribution as a writing "collaborator" under various definitions of good collaboration. Finally, we discuss how this work may facilitate a more principled discussion around LMs' promises and pitfalls in relation to interaction design. The dataset and an interface for replaying the writing sessions are publicly available at https://coauthor.stanford.edu.

Submitted to arXiv on 18 Jan. 2022

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2201.06796v2

This paper discusses the use of large language models (LMs) in interaction design and proposes the use of curated datasets to examine their generative capabilities. The authors present CoAuthor, a dataset designed to explore GPT-3's abilities in creative and argumentative writing. The dataset captures interactions between 63 writers and four instances of GPT-3 across 1445 writing sessions. A survey was conducted after each session to assess the capabilities and limitations of LMs, as well as overall experiences. The dataset provides insights into GPT-3's language generation, ideation, and collaboration capabilities. It also allows for a more principled discussion around the promises and pitfalls of LMs in interaction design. The paper demonstrates that GPT-3 generates fluent text with fewer errors compared to human writers, provides new ideas to writers, and can collaborate effectively with them. The CoAuthor dataset is publicly available for further analysis which enables further exploration into the potentials of using large language models in interaction design.
Created on 26 Sep. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.