Table-GPT: Table-tuned GPT for Diverse Table Tasks
AI-generated Key Points
- Language models like GPT-3.5 and ChatGPT have impressive capabilities in following diverse human instructions and performing tasks
- Performance in table-related tasks is sub-optimal due to being trained on one-dimensional texts
- A new "table-tuning" paradigm is proposed to further train or fine-tune language models using diverse table tasks synthesized from real tables
- The approach involves the "synthesize-then-augment" method, creating diverse table tasks using real tables for training
- Main steps include sampling a table and task type, synthesizing an instance of the task, and augmenting tasks at different levels
- Two approaches are proposed for synthesizing diverse instances of table tasks: task-diversity and data-diversity
- Real tables from sources like web-tables (Cπ€π‘) and database-tables (Cππ) are used to create various types of table-understanding/augmentation/manipulation tasks
- Examples of synthesized tasks include Table Summarization (TS) and Column Augmentation
- Synthesized tasks aim to improve language models' understanding of two-dimensional table structures using real-world examples
- The synthesis-then-augment approach helps language models better understand and perform various table-related tasks, enhancing their overall performance with relational data structures
Authors: Peng Li, Yeye He, Dror Yashar, Weiwei Cui, Song Ge, Haidong Zhang, Danielle Rifinski Fainman, Dongmei Zhang, Surajit Chaudhuri
Abstract: Language models, such as GPT-3.5 and ChatGPT, demonstrate remarkable abilities to follow diverse human instructions and perform a wide range of tasks. However, when probing language models using a range of basic table-understanding tasks, we observe that today's language models are still sub-optimal in many table-related tasks, likely because they are pre-trained predominantly on \emph{one-dimensional} natural-language texts, whereas relational tables are \emph{two-dimensional} objects. In this work, we propose a new "\emph{table-tuning}" paradigm, where we continue to train/fine-tune language models like GPT-3.5 and ChatGPT, using diverse table-tasks synthesized from real tables as training data, with the goal of enhancing language models' ability to understand tables and perform table tasks. We show that our resulting Table-GPT models demonstrate (1) better \emph{table-understanding} capabilities, by consistently outperforming the vanilla GPT-3.5 and ChatGPT, on a wide-range of table tasks, including holdout unseen tasks, and (2) strong \emph{generalizability}, in its ability to respond to diverse human instructions to perform new table-tasks, in a manner similar to GPT-3.5 and ChatGPT.
Ask questions about this paper to our AI assistant
You can also chat with multiple papers at once here.
Assess the quality of the AI-generated content by voting
Score: 0
Why do we need votes?
Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.
The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.
Similar papers summarized with our AI tools
Navigate through even more similar papers through a
tree representationLook for similar papers (in beta version)
By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.
Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.