Pure Transformers are Powerful Graph Learners
AI-generated Key Points
- Standard Transformers can be effective in graph learning without any graph-specific modifications
- Tokenized Graph Transformer (TokenGT) approach treats all nodes and edges as independent tokens and augment them with token embeddings before feeding them to a Transformer
- With an appropriate choice of token embeddings, TokenGT is theoretically at least as expressive as an invariant graph network composed of equivariant linear layers
- TokenGT outperforms message-passing Graph Neural Networks (GNN) baselines and achieves competitive results compared to other Transformer variants when trained on a large-scale graph dataset (PCQM4Mv2)
- The implementation of TokenGT is available on GitHub
- TokenGT works well with large-scale data and achieve promising results in graph learning both in theory and practice
- The authors explore the capability of TokenGT on the PCQM4Mv2 quantum chemistry regression dataset, which has 3.7 million molecular graphs, using both node and type identifiers in their model
- They also apply kernel attention that approximates the attention computation to linear cost
- This research can contribute to advancing machine learning technology for various applications supported by organizations like the Korea government (MSIT)
Authors: Jinwoo Kim, Tien Dat Nguyen, Seonwoo Min, Sungjun Cho, Moontae Lee, Honglak Lee, Seunghoon Hong
Abstract: We show that standard Transformers without graph-specific modifications can lead to promising results in graph learning both in theory and practice. Given a graph, we simply treat all nodes and edges as independent tokens, augment them with token embeddings, and feed them to a Transformer. With an appropriate choice of token embeddings, we prove that this approach is theoretically at least as expressive as an invariant graph network (2-IGN) composed of equivariant linear layers, which is already more expressive than all message-passing Graph Neural Networks (GNN). When trained on a large-scale graph dataset (PCQM4Mv2), our method coined Tokenized Graph Transformer (TokenGT) achieves significantly better results compared to GNN baselines and competitive results compared to Transformer variants with sophisticated graph-specific inductive bias. Our implementation is available at https://github.com/jw9730/tokengt.
Ask questions about this paper to our AI assistant
You can also chat with multiple papers at once here.
Assess the quality of the AI-generated content by voting
Score: 0
Why do we need votes?
Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.
The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.
Similar papers summarized with our AI tools
Navigate through even more similar papers through a
tree representationLook for similar papers (in beta version)
By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.
Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.