Bipko Digital News & Media Platform

collapse
Home / Daily News Analysis / The five-star review system is broken, exhibit #472,304.

The five-star review system is broken, exhibit #472,304.

May 31, 2026  Twila Rosenbaum  7 views
The five-star review system is broken, exhibit #472,304.

The five-star review system, once a simple tool for consumer feedback, has become a source of frustration for developers and a misleading metric for users. The latest exhibit in this ongoing saga comes from Terry Godier, creator of the RSS reader Current. In a recent post, Godier highlighted an unavoidable problem: in the current system, anything below a perfect five-star rating is often perceived as a catastrophe, leading to a paradox where even glowing four-star reviews can drag down an app's overall score.

The Problem with Four Stars

Godier's observation is backed by data. On both the Apple App Store and Google Play, users frequently leave four-star ratings accompanied by comments like 'This is my favorite app!' or 'Gamechanger!' Yet because the average rating is computed arithmetically, these well-intentioned reviews actually lower the score. For an app with a perfect 5.0 rating, a single four-star review drops the average to 4.8 or below, depending on the total number of ratings. Over time, a cascade of such 'good but not perfect' scores can push a high-quality app below 4.0, making it appear mediocre to new users. This phenomenon is not limited to small developers; even major apps like Instagram and YouTube have seen their ratings fluctuate due to this inflation bias.

Psychology of Review Inflation

The root cause lies in the psychology of rating systems. Users often reserve five stars for a subjective 'perfect' experience, while four stars are used to indicate 'very good.' But the platform treats both as quantifiable metrics, and a four is simply not a five. Compounding this is the fact that many users only bother to rate when they are either extremely happy or extremely upset. Disgruntled users are more likely to give a one-star rating, while satisfied customers often give four or five stars without much thought. This creates a U-shaped distribution where the extremes are overrepresented, and the nuanced middle ground is lost. Developers then face a choice: beg for five-star reviews to maintain a high average, or accept that even excellent work will be dinged by users who think 'nothing is perfect.'

Impact on Small Developers

For indie developers like Godier, the stakes are even higher. A new app launching with a handful of reviews can be severely impacted by a single four-star rating. In the highly competitive app marketplace, a drop from 5.0 to 4.5 can mean the difference between being featured by the store and being buried in search results. Moreover, users often filter by rating, setting a minimum threshold of four stars or higher. An app hovering at 3.9 stars might be invisible to a significant portion of potential downloaders, even if the ratings it does have are overwhelmingly positive. This creates a system where developers are incentivized to game the algorithm—requesting only five-star reviews, hiding rating prompts until after a user has had a flawless experience, or even buying fake positive reviews. All of this degrades trust in the entire ecosystem.

Historical Context

The five-star system has its origins in the early days of e-commerce, popularized by Amazon in the late 1990s. It was meant to provide a quick visual summary of customer sentiment. However, as digital marketplaces grew, the limitations became apparent. Studies have shown that rating distributions are often bimodal, with peaks at one and five stars. This is especially true for emotional purchases or subjective experiences like apps and games. In contrast, review systems that use simpler binary feedback (thumbs up/down) or continuous scales (like Netflix's old star system prior to 2017) tend to produce more accurate representations of user satisfaction. The Netflix example is particularly instructive: the company moved away from stars because users were not rating based on objective quality but on their personal enjoyment, which varied widely. Similarly, app ratings are often proxies for user mood, not app quality.

Alternatives and Potential Solutions

Several alternatives have been proposed to fix the broken star system. One is to use a 'net promoter score' (NPS) approach, where users are asked simply if they would recommend the app, on a scale of 0 to 10. This metric is then categorized into detractors, passives, and promoters, providing a more nuanced picture. Another idea is to adopt a 'thumbs up/thumbs down' system, similar to YouTube or Reddit, which reduces the granularity but also reduces the cognitive load on users. A third option, already implemented by some platforms like TripAdvisor, is to display the median rating instead of the average, which is less sensitive to outliers. However, changing the rating system on a platform as large as the App Store is a massive undertaking, and Apple and Google have shown little interest in overhauling their decades-old design. For now, developers like Godier are left to cope with the broken system as best they can.

Developer Strategies

In response to these challenges, many developers have adopted proactive strategies. They include in-app prompts that ask users to rate after a positive interaction, such as completing a task or reaching a milestone. Some developers also add a 'request review' button that appears only after a session of above-average engagement. Others have turned to external review platforms like Product Hunt or dedicated subreddits to gather more authentic feedback that doesn't impact the App Store rating. Godier himself has noted that he might need to ask users to leave a rating only if they feel the app deserves five stars, a tactic that feels manipulative but is necessary for survival. The burden should not be on developers to manage their ratings so carefully; the system itself should be improved.

The Broader Implications

The five-star review problem extends beyond mobile apps. It affects everything from hotel bookings to online courses to consumer products. In every case, the same dynamics apply: a few low ratings can outweigh many high ones, and the nuance of user experience is lost in a crude numerical average. As consumers, we rely on these ratings to make decisions, but we are often misled. A product with a 4.5-star average might actually have a significant number of dissatisfied customers, while a 3.8-star product might be excellent but penalized by a few haters. The only way to truly assess quality is to read the actual reviews, which most users don't have time to do. This is why the system is broken, and why it needs a fundamental rethink.

Terry Godier's post about Current is just the latest reminder of a systemic flaw. His app, which offers a clean and powerful RSS reading experience, is receiving praise in the text of reviews but is being penalized in the numbers. Until app stores adopt a more sensible rating scheme, developers will continue to fight an uphill battle against the tyranny of five stars.


Source: The Verge News


Share:

Your experience on this site will be improved by allowing cookies Cookie Policy