ADR-012: Priority Scoring System for Documentation Drift Detection

Status

Accepted

Date

2025-01-14

Context

As documentation drift detection scales to larger codebases, we need a systematic way to prioritize which drift issues require immediate attention versus those that can be deferred. Without prioritization, developers face information overload and may miss critical documentation updates while being overwhelmed by low-priority notifications.

Core Problem: Documentation drift detection can identify hundreds of potential issues in a large codebase, but not all issues have equal impact:

High-Impact Issues: Frequently-used APIs with breaking changes that affect many users
Medium-Impact Issues: Internal utilities with moderate usage and non-breaking changes
Low-Impact Issues: Rarely-used code paths with minor changes or comprehensive existing documentation

Current State: The drift detection system (ADR-009) identifies drift issues but lacks prioritization, leading to:

Developer fatigue from too many notifications
Critical issues being missed among low-priority noise
Inefficient documentation maintenance workflows
Difficulty focusing on high-value documentation updates

Strategic Importance: Effective prioritization directly impacts:

Developer productivity and focus
Documentation quality and completeness
User experience with up-to-date documentation
Maintenance efficiency and resource allocation

Decision

We will implement a multi-factor priority scoring system for documentation drift that considers code complexity, usage frequency, change magnitude, documentation coverage, staleness, and user feedback. The system calculates an overall priority score (0-100) and provides actionable recommendations (critical/high/medium/low) to guide developers in addressing the most impactful drift issues first.

Priority Scoring Architecture:

1. Multi-Factor Scoring Model

Purpose: Comprehensive priority assessment using multiple data sources Factors:

Code Complexity (weight: 0.20): Measures cyclomatic complexity and code structure complexity
Usage Frequency (weight: 0.25): Tracks function calls, class instantiations, and imports
Change Magnitude (weight: 0.25): Assesses the scope and impact of code changes
Documentation Coverage (weight: 0.15): Evaluates existing documentation completeness
Staleness (weight: 0.10): Measures time since last documentation update
User Feedback (weight: 0.05): Incorporates user-reported issues and feedback

2. Configurable Weight System

Purpose: Allow projects to tune priority scoring based on their specific needs Capabilities:

Default weights optimized for general use cases
Project-specific weight configuration
Dynamic weight adjustment based on project maturity
A/B testing support for weight optimization

3. Actionable Recommendations

Purpose: Translate priority scores into clear action guidance Recommendation Levels:

Critical (90-100): Immediate attention required, breaking changes affecting many users
High (70-89): Address within current sprint, significant impact on user experience
Medium (40-69): Plan for next sprint, moderate impact
Low (0-39): Backlog item, minimal immediate impact

Implementation Details:

Priority Scoring Interface

interface DriftPriorityScore {
  overall: number; // 0-100
  factors: {
    codeComplexity: number; // 0-100
    usageFrequency: number; // 0-100
    changeMagnitude: number; // 0-100
    documentationCoverage: number; // 0-100
    staleness: number; // 0-100
    userFeedback: number; // 0-100
  };
  recommendation: "critical" | "high" | "medium" | "low";
  suggestedAction: string;
}

interface PriorityWeights {
  codeComplexity: number; // default: 0.20
  usageFrequency: number; // default: 0.25
  changeMagnitude: number; // default: 0.25
  documentationCoverage: number; // default: 0.15
  staleness: number; // default: 0.10
  userFeedback: number; // default: 0.05
}

interface PrioritizedDriftResult extends DriftDetectionResult {
  priorityScore?: DriftPriorityScore;
}

interface UsageMetadata {
  filePath: string;
  functionCalls: Map<string, number>; // function name -> call count
  classInstantiations: Map<string, number>; // class name -> instantiation count
  imports: Map<string, number>; // symbol -> import count
}

Scoring Algorithm

class PriorityScorer {
  calculatePriorityScore(
    drift: DocumentationDrift,
    usageMetadata: UsageMetadata,
    weights: PriorityWeights = DEFAULT_WEIGHTS,
  ): DriftPriorityScore {
    const factors = {
      codeComplexity: this.calculateComplexityScore(drift.codeChanges),
      usageFrequency: this.calculateUsageScore(usageMetadata),
      changeMagnitude: this.calculateMagnitudeScore(drift),
      documentationCoverage: this.calculateCoverageScore(drift.affectedDocs),
      staleness: this.calculateStalenessScore(drift.detectedAt),
      userFeedback: this.calculateFeedbackScore(drift),
    };

    const overall = this.weightedSum(factors, weights);

    return {
      overall,
      factors,
      recommendation: this.getRecommendation(overall),
      suggestedAction: this.generateAction(factors, overall),
    };
  }

  private weightedSum(factors: FactorScores, weights: PriorityWeights): number {
    return (
      factors.codeComplexity * weights.codeComplexity +
      factors.usageFrequency * weights.usageFrequency +
      factors.changeMagnitude * weights.changeMagnitude +
      factors.documentationCoverage * weights.documentationCoverage +
      factors.staleness * weights.staleness +
      factors.userFeedback * weights.userFeedback
    );
  }

  private getRecommendation(score: number): RecommendationLevel {
    if (score >= 90) return "critical";
    if (score >= 70) return "high";
    if (score >= 40) return "medium";
    return "low";
  }
}

Alternatives Considered

Simple Severity-Based Prioritization

Pros: Simple implementation, easy to understand, fast execution
Cons: Too simplistic, ignores usage patterns and real-world impact, doesn't scale
Decision: Rejected - fails to capture the complexity of real-world documentation maintenance

Manual Prioritization Only

Pros: Full developer control, no algorithmic complexity, flexible
Cons: Doesn't scale, high cognitive load, inconsistent prioritization, time-consuming
Decision: Rejected - manual prioritization doesn't scale to large codebases

Single-Factor Scoring

Pros: Simple to implement and understand, fast calculation
Cons: Doesn't capture complexity of real-world scenarios, misses important factors
Decision: Rejected - single-factor approaches are too narrow for comprehensive prioritization

ML-Based Prioritization

Pros: Could learn from historical patterns, adaptive to project-specific needs
Cons: Requires training data, model maintenance overhead, less transparent, harder to debug
Decision: Deferred to future enhancement - current rule-based approach is sufficient and more transparent

Consequences

Positive

Developer Focus: Developers can focus on high-impact drift issues first, reducing cognitive load
Improved Efficiency: Data-driven prioritization improves documentation maintenance efficiency
Prevents Critical Gaps: Helps prevent critical documentation gaps from being overlooked
Scalability: System scales to large codebases without overwhelming developers
Configurability: Project-specific weight configuration allows customization
Actionable Guidance: Clear recommendations help developers make informed decisions

Negative

Additional Complexity: Priority scoring adds complexity to the drift detection system
Performance Impact: Usage metadata collection may impact analysis performance
Algorithm Tuning: Scoring algorithm needs tuning based on real-world usage patterns
Metadata Requirements: Requires usage tracking infrastructure
Weight Configuration: Projects need to understand and configure weights appropriately

Risks and Mitigations

Inaccurate Scoring: Validate scoring accuracy through user feedback and iteration
Performance Impact: Implement efficient usage metadata collection with caching
Weight Misconfiguration: Provide sensible defaults and clear documentation
Over-Prioritization: Ensure low-priority issues aren't completely ignored

Integration Points

Drift Detection Integration (ADR-009)

Priority scoring integrates seamlessly with existing drift detection
Enhances drift detection results with actionable prioritization
Uses drift detection data for factor calculation

Repository Analysis Integration (ADR-002)

Leverages repository analysis for code complexity assessment
Uses analysis data for usage frequency calculation
Integrates with code structure analysis

Knowledge Graph Integration

Stores priority scores in knowledge graph for historical tracking
Enables trend analysis of priority patterns over time
Supports learning from prioritization outcomes

Implementation Roadmap

Phase 1: Core Scoring System (Completed)

Multi-factor scoring algorithm implementation
Default weight configuration
Basic recommendation generation
Integration with drift detection

Phase 2: Usage Metadata Collection (In Progress)

Function call tracking
Import analysis
Class instantiation tracking
Metadata persistence

Phase 3: Advanced Features (Planned)

Machine learning integration for weight optimization
Historical trend analysis
Project-specific weight recommendations
User feedback integration

Quality Assurance

Scoring Accuracy Validation

describe("PriorityScorer", () => {
  it("should calculate accurate priority scores");
  it("should handle edge cases (no usage data, missing metadata)");
  it("should respect weight configuration");
  it("should generate appropriate recommendations");
  it("should provide actionable suggestions");
});

Performance Testing

Usage metadata collection performance benchmarks
Scoring calculation performance under load
Memory usage with large codebases

User Feedback Integration

Collect developer feedback on prioritization accuracy
Iterate on scoring algorithm based on real-world usage
Validate recommendation effectiveness

Success Metrics

Prioritization Effectiveness

Critical Issue Detection: 95%+ of critical issues correctly identified
Developer Satisfaction: 80%+ developers find prioritization helpful
Documentation Coverage: Improved coverage of high-priority areas
Response Time: Faster response to critical drift issues

System Performance

Scoring Performance: Under 100ms per drift issue scored
Metadata Collection: Under 5% overhead on analysis performance
Accuracy: Priority scores correlate with developer-assigned priorities

Future Enhancements

Advanced Prioritization

Machine learning-based weight optimization
Project-specific learning and adaptation
User behavior analysis for usage frequency
Integration with issue tracking systems

Enhanced Metadata

Runtime usage tracking
User feedback collection
Documentation view analytics
Search query analysis

References

ADR-002: Multi-Layered Repository Analysis Engine Design
ADR-009: Content Accuracy and Validation Framework
Commit: 40afe64 - feat: Add priority scoring system for documentation drift (#83)
GitHub Issue: #83 - Priority scoring system for documentation drift

Status​

Date​

Context​

Decision​

Priority Scoring Architecture:​

1. Multi-Factor Scoring Model​

2. Configurable Weight System​

3. Actionable Recommendations​

Implementation Details:​

Priority Scoring Interface​

Scoring Algorithm​

Alternatives Considered​

Simple Severity-Based Prioritization​

Manual Prioritization Only​

Single-Factor Scoring​

ML-Based Prioritization​

Consequences​

Positive​

Negative​

Risks and Mitigations​

Integration Points​

Drift Detection Integration (ADR-009)​

Repository Analysis Integration (ADR-002)​

Knowledge Graph Integration​

Implementation Roadmap​

Phase 1: Core Scoring System (Completed)​

Phase 2: Usage Metadata Collection (In Progress)​

Phase 3: Advanced Features (Planned)​

Quality Assurance​

Scoring Accuracy Validation​

Performance Testing​

User Feedback Integration​

Success Metrics​

Prioritization Effectiveness​

System Performance​

Future Enhancements​

Advanced Prioritization​

Enhanced Metadata​

References​