Real-World Recommender Systems for Academia: The Pain and Gain in Building, Operating, and Researching them [Long Version]

AI-generated keywords: Recommender Systems Research Evaluation Data Quality Experiments

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Challenges faced in research on recommender systems:
  • Non-reproducible research results
  • Dealing with noisy data
  • Determining the optimal number and frequency of recommendations
  • Three research-article recommender systems built by the authors over six years
  • Lack of guidance from existing literature in identifying effective recommendation approaches
  • Difficulties encountered in creating a randomization engine for A/B tests
  • Low data quality affecting bibliometrics calculations and evaluation process
  • Experiments yielding disappointing results and reasons behind them
  • Statistics on researcher interest in recommendation dataset
  • Insights into skill requirements, limitations of existing literature, experimental design issues, data quality concerns, and researcher engagement with recommendation datasets.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Joeran Beel, Siddharth Dinesh

This article is a long version of the article published in the Proceedings of the 5th International Workshop on Bibliometric-enhanced Information Retrieval (BIR)

Abstract: Research on recommender systems is a challenging task, as is building and operating such systems. Major challenges include non-reproducible research results, dealing with noisy data, and answering many questions such as how many recommendations to display, how often, and, of course, how to generate recommendations most effectively. In the past six years, we built three research-article recommender systems for digital libraries and reference managers, and conducted research on these systems. In this paper, we share some experiences we made during that time. Among others, we discuss the required skills to build recommender systems, and why the literature provides little help in identifying promising recommendation approaches. We explain the challenge in creating a randomization engine to run A/B tests, and how low data quality impacts the calculation of bibliometrics. We further discuss why several of our experiments delivered disappointing results, and provide statistics on how many researchers showed interest in our recommendation dataset.

Submitted to arXiv on 01 Apr. 2017

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 1704.00156v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

The paper titled "Real-World Recommender Systems for Academia: The Pain and Gain in Building, Operating, and Researching them [Long Version]" by Joeran Beel and Siddharth Dinesh discusses the challenges faced in research on recommender systems. The authors highlight major obstacles such as non-reproducible research results, dealing with noisy data, and determining the optimal number and frequency of recommendations. Over a period of six years, the authors built three research-article recommender systems for digital libraries and reference managers. They conducted extensive research on these systems and share their experiences in this paper. One of the key points discussed is the lack of guidance from existing literature in identifying effective recommendation approaches. The authors also explain the difficulties encountered in creating a randomization engine to run A/B tests for evaluating recommender system performance. Additionally, they address how low data quality affects bibliometrics calculations which further complicates the evaluation process. Furthermore, the paper explores why some of their experiments yielded disappointing results. The authors provide statistics on the level of interest shown by researchers in their recommendation dataset. Overall, this paper provides valuable insights into the challenges faced when building and operating recommender systems for academia. It sheds light on various aspects such as skill requirements, limitations of existing literature, experimental design issues, data quality concerns, and researcher engagement with recommendation datasets.
Created on 05 Jul. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.