MetaTune: Meta-Learning Based Cost Model for Fast and Efficient Auto-tuning Frameworks
AI-generated Key Points
- Deep learning compiler frameworks are becoming more popular for deep learning applications on diverse hardware
- They can match the performance of target-specific libraries while reducing overhead costs
- MetaTune is a new paper proposing a meta-learning based cost model that predicts performance of optimized codes with pre-trained model parameters
- MetaTune encodes convolution kernel codes as structurally similar graphs to facilitate meta-learning and then meta-trains a GNN model with a small input data set
- The resulting framework provides 8 to 13% better inference time on average for four CNN models with comparable or lower optimization time while outperforming transfer learning by 10% in cross-platform cases
- Accuracy of the cost model is essential in locating ideal optimization parameters; many auto-tuning frameworks adopt machine-learning based cost models that can dynamically adapt to search results
- MetaTune offers an efficient solution to optimize deep learning applications across different hardware platforms without sacrificing performance or increasing overhead costs associated with auto-tuning frameworks.
Authors: Jaehun Ryu, Hyojin Sung
Abstract: Deep learning compiler frameworks are gaining ground as a more portable back-end for deep learning applications on increasingly diverse hardware. However, they face the daunting challenge of matching performance offered by hand-tuned target-specific libraries. While auto-tuning frameworks with statistical cost models can provide dynamic and efficient code optimization, they suffer from large space exploration and cost model training overheads. This paper proposes MetaTune, a meta-learning based cost model that more quickly and accurately predicts the performance of optimized codes with pre-trained model parameters. MetaTune encodes convolution kernel codes as structurally similar graphs to facilitate meta-learning, meta-trains a GNN model with a very small input data set, and then predicts optimization parameters for unseen convolution operations with varying sizes and structures during compilation. The resulting framework with MetaTune provides 8 to 13% better inference time on average for four CNN models with comparable or lower optimization time while outperforming transfer learning by 10% in cross-platform cases.
Ask questions about this paper to our AI assistant
You can also chat with multiple papers at once here.
Assess the quality of the AI-generated content by voting
Score: 0
Why do we need votes?
Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.
The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.
Similar papers summarized with our AI tools
Navigate through even more similar papers through a
tree representationLook for similar papers (in beta version)
By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.
Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.