Weightless Neural Networks for Efficient Edge Inference

AI-generated keywords: Weightless Neural Networks (WNNs)

AI-generated Key Points

  • Weightless Neural Networks (WNNs) use table lookups for inference, while Deep Neural Networks (DNNs) rely on multiply-accumulate operations.
  • WNN architectures have lower implementation costs than DNNs but typically lag behind in accuracy and have high memory requirements.
  • Recent research has led to significant improvements in WNN performance.
  • The authors propose a novel WNN architecture called BTHOWeN that incorporates algorithmic and architectural improvements over prior work, including counting Bloom filters, hardware-friendly hashing, and Gaussian-based nonlinear thermometer encodings.
  • BTHOWeN is designed specifically for edge computing applications, providing superior latency and energy efficiency compared to comparable quantized DNNs.
  • BTHOWeN reduces error by more than 40% and model size by more than 50% on average when compared with state-of-the-art WNNs across nine classification datasets.
  • An FPGA-based accelerator for BTHOWeN consumes almost 80% less energy than MLP models with nearly 85% reduction in latency.
  • Efficient machine learning on the edge is important, and WNNs offer a promising alternative due to their lower memory requirements and energy consumption.
  • The authors' proposed BTHOWeN architecture represents a significant step forward in the development of efficient WNNs for edge computing applications.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Zachary Susskind, Aman Arora, Igor Dantas Dos Santos Miranda, Luis Armando Quintanilla Villon, Rafael Fontella Katopodis, Leandro Santiago de Araujo, Diego Leonel Cadette Dutra, Priscila Machado Vieira Lima, Felipe Maia Galvao Franca, Mauricio Breternitz Jr., Lizy K. John

License: CC BY 4.0

Abstract: Weightless Neural Networks (WNNs) are a class of machine learning model which use table lookups to perform inference. This is in contrast with Deep Neural Networks (DNNs), which use multiply-accumulate operations. State-of-the-art WNN architectures have a fraction of the implementation cost of DNNs, but still lag behind them on accuracy for common image recognition tasks. Additionally, many existing WNN architectures suffer from high memory requirements. In this paper, we propose a novel WNN architecture, BTHOWeN, with key algorithmic and architectural improvements over prior work, namely counting Bloom filters, hardware-friendly hashing, and Gaussian-based nonlinear thermometer encodings to improve model accuracy and reduce area and energy consumption. BTHOWeN targets the large and growing edge computing sector by providing superior latency and energy efficiency to comparable quantized DNNs. Compared to state-of-the-art WNNs across nine classification datasets, BTHOWeN on average reduces error by more than than 40% and model size by more than 50%. We then demonstrate the viability of the BTHOWeN architecture by presenting an FPGA-based accelerator, and compare its latency and resource usage against similarly accurate quantized DNN accelerators, including Multi-Layer Perceptron (MLP) and convolutional models. The proposed BTHOWeN models consume almost 80% less energy than the MLP models, with nearly 85% reduction in latency. In our quest for efficient ML on the edge, WNNs are clearly deserving of additional attention.

Submitted to arXiv on 03 Mar. 2022

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2203.01479v1

Weightless Neural Networks (WNNs) are a type of machine learning model that use table lookups to perform inference, as opposed to Deep Neural Networks (DNNs), which rely on multiply-accumulate operations. While WNN architectures have lower implementation costs than DNNs, they typically lag behind in accuracy for common image recognition tasks and suffer from high memory requirements. However, recent research efforts have led to significant improvements in WNN performance. In this paper, the authors propose a novel WNN architecture called BTHOWeN, which incorporates algorithmic and architectural improvements over prior work. These include counting Bloom filters, hardware-friendly hashing, and Gaussian-based nonlinear thermometer encodings to improve model accuracy while reducing area and energy consumption. BTHOWeN is designed specifically for the growing edge computing sector, providing superior latency and energy efficiency compared to comparable quantized DNNs. The authors demonstrate the effectiveness of BTHOWeN by comparing it against state-of-the-art WNNs across nine classification datasets. On average, BTHOWeN reduces error by more than 40% and model size by more than 50%. The authors also present an FPGA-based accelerator for BTHOWeN and compare its latency and resource usage against similarly accurate quantized DNN accelerators such as Multi-Layer Perceptron (MLP) and convolutional models. The proposed BTHOWeN models consume almost 80% less energy than MLP models with nearly 85% reduction in latency. The paper highlights the importance of efficient machine learning on the edge and suggests that WNNs deserve additional attention in this regard. Algorithmic and hardware improvements have driven rapid increases in DNN accuracies during the past decade; however, their large network sizes require significant computational resources that may not be feasible for edge devices with limited power budgets. In contrast, WNNs offer a promising alternative due to their lower memory requirements and energy consumption. The authors' proposed BTHOWeN architecture represents a significant step forward in the development of efficient WNNs for edge computing applications.
Created on 02 Jun. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.