Maven Telemetry Service
Discussion: https://github.com/support-and-care/maven-support-and-care/discussions/199
Original Item: "Maven Telemetry Service (Tamás)"
Problem Statement
The Maven ecosystem lacks reliable, comprehensive usage data. Understanding which plugins, dependencies, Maven versions, and Java versions are most widely used is essential for prioritizing maintenance effort, deprecation decisions, and roadmap planning. Maven Central download statistics provide only a partial picture, as Maven Repository Managers (MRMs) and other artifact sources skew the data significantly.
Previous attempts at collecting IDE usage telemetry (e.g., Eclipse M2E) faced community backlash due to insufficient transparency and consent mechanisms.
Current State / Starting Point
- No centralized, anonymized telemetry exists for Maven CLI usage
- Maven Central provides download counts, but these are skewed by MRM caching and mirroring
- No data on which Maven or Java versions are in active use
- Plugin maintainers have no visibility into which plugin versions are still widely deployed
- Related observability work is discussed in Build Observability and Reporting and Open Tracing Support (#73)
Expected Outcomes / Benefits
- Opt-In Telemetry Collection: A strictly opt-in, privacy-respecting mechanism for Maven to report anonymized usage data, including:
- Maven and Java version combinations in use
- Most used plugins and plugin versions
- Most used dependencies
-
Build environment characteristics (OS, CI vs. local)
-
Aggregated Usage Dashboard: A public dashboard providing the Maven community and maintainers with actionable insights:
- Adoption rates for new Maven versions
- Plugin popularity and version distribution
-
Dependency usage trends
-
Data-Driven Decision Making: Enable the Maven project to make informed decisions about:
- Which plugin versions to maintain or deprecate
- Where to focus compatibility testing
-
Migration progress (e.g., Maven 3 to Maven 4 adoption)
-
Privacy by Design: Ensure community trust through:
- Explicit opt-in (no data collection by default)
- Full anonymization of collected data
- Transparent data collection scope and retention policies
- Open-source telemetry infrastructure
Final Discussion (2026-02-24)
- Discussion:
- Use telemetry standards as far as possible (e.g., OpenTelemetry, Prometheus)
- Propose telemetry infrastructure
- Provide documentation about good practices for telemetry (e.g. with Grafana etc.)
- Decision: Request Funding