Federated Learning of Deep Networks using Model Averaging
AI-generated Key Points
⚠The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.
- Federated learning is a decentralized approach to training machine learning models on mobile devices
- Modern mobile devices have access to privacy-sensitive and large quantities of data that can improve user experience
- Conventional approaches for training models require logging data in a centralized data center, which is challenging for privacy-sensitive or large datasets
- Federated learning allows the training data to remain distributed on the mobile devices themselves, with a shared model learned by aggregating locally-computed updates from each device
- This approach enables high-quality models to be trained in relatively few rounds of communication, addressing a key constraint in federated learning
- The paper presents a practical method for federated learning of deep networks that is robust to unbalanced and non-IID data distributions commonly found on mobile devices
- Parameter averaging over updates from multiple clients produces surprisingly good results, even when optimizing non-convex loss functions
- The proposed method significantly reduces communication needed to train an LSTM language model by two orders of magnitude
- Federated learning offers an alternative approach for training machine learning models on mobile devices while addressing privacy concerns and large datasets, with promising results in terms of model quality and communication efficiency.
Authors: H. Brendan McMahan, Eider Moore, Daniel Ramage, Blaise Agüera y Arcas
Abstract: Modern mobile devices have access to a wealth of data suitable for learning models, which in turn can greatly improve the user experience on the device. For example, language models can improve speech recognition and text entry, and image models can automatically select good photos. However, this rich data is often privacy sensitive, large in quantity, or both, which may preclude logging to the data-center and training there using conventional approaches. We advocate an alternative that leaves the training data distributed on the mobile devices, and learns a shared model by aggregating locally-computed updates. We term this decentralized approach Federated Learning. We present a practical method for the federated learning of deep networks that proves robust to the unbalanced and non-IID data distributions that naturally arise. This method allows high-quality models to be trained in relatively few rounds of communication, the principal constraint for federated learning. The key insight is that despite the non-convex loss functions we optimize, parameter averaging over updates from multiple clients produces surprisingly good results, for example decreasing the communication needed to train an LSTM language model by two orders of magnitude.
Ask questions about this paper to our AI assistant
You can also chat with multiple papers at once here.
⚠The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.
Assess the quality of the AI-generated content by voting
Score: 0
Why do we need votes?
Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.
The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.
⚠The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.
Similar papers summarized with our AI tools
Navigate through even more similar papers through a
tree representationLook for similar papers (in beta version)
By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.
Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.