ReviewGap: Identifying Documentation Gaps Before They Become Tickets
The Problem
Support teams know the frustration: users submit tickets for issues that should be covered in documentationâbut arenât, or are buried where no one finds them. Meanwhile, documentation teams donât always know which gaps are causing the most pain until the ticket queue tells them (too late).
What I Built
ReviewGap is a Python tool that analyzes product reviews to surface recurring complaints that documentation could address. Instead of waiting for tickets to pile up, it identifies patterns in customer feedback and prioritizes which support articles would have the most impact.
How It Works
- Ingests review data from product feedback sources
- Filters out noise â general complaints, shipping issues, and other non-documentation problems
- Clusters similar issues â so you see â15 users struggled with password resetâ instead of 15 separate complaints
- Prioritizes by impact â frequency matters, but critical issues like security or data loss get flagged immediately
Results
Testing against real review data cut analysis time from over an hour to about four minutes while producing cleaner, more actionable output.
Why This Matters for Support
Good documentation reduces ticket volume. But knowing what to documentâand whatâs missingâusually requires digging through support history manually. This tool automates that discovery process.
What I Learned
The first version captured everything, which made the output overwhelming and not particularly useful. Adding relevance filtering taught me that raw data isnât insightâyou need to design for the right data. That same principle applies to troubleshooting: knowing what to ignore is as important as knowing what to investigate.
Link
The link to the GitHub Repo is here: Review Gap