ClawGit — Cut Through the Larp

05 -- Documentation

OPENCLAW AI
AGENT DOCS

What is ClawGit?

ClawGit is an AI-powered GitHub repository analysis platform designed to detect code fraud, performative programming, and misleading open-source projects. Powered by the OpenClaw engine, it performs deep multi-layer analysis of repository structure, code quality, commit patterns, and claimed metrics.

The Problem

The open-source ecosystem is increasingly polluted with repositories that look impressive on the surface but contain little to no real functionality. These "larp repos" feature polished READMEs, inflated star counts, and bold claims — but the actual code tells a different story. Developers waste time evaluating these projects, investors get misled, and the community suffers from noise drowning out genuine work.

The Solution

ClawGit uses AI-driven forensic analysis to cut through the noise. By examining every file, every commit, and every dependency, OpenClaw builds a comprehensive picture of a repository's legitimacy and delivers a confidence-weighted verdict backed by specific evidence.

Key Capabilities

Deep code forensics across every source file in the repository
Commit history pattern analysis and anomaly detection
Test infrastructure scanning and coverage estimation
Claimed metric verification against actual evaluation code
Dependency manifest validation and usage tracking
AI/ML model validation including architecture and weight checks
Confidence-weighted verdicts with detailed evidence reports

Current Status

ClawGit is currently in closed beta. The core scanning engine is operational with code forensics, commit analysis, and test coverage checking live. Dependency auditing and AI model validation are in active development. Full API access and PDF report exports are planned for the public launch.

System Architecture

OpenClaw operates as a pipeline of specialized analysis modules, each designed to examine a specific dimension of repository legitimacy. The modules run in parallel where possible and feed their results into a central verdict engine.

Pipeline Overview

// OpenClaw Analysis Pipeline Repository URL |-- Repository Fetcher -- Clone + file tree mapping |-- Code Forensics Engine -- Static analysis per file |-- Commit Analyzer -- Git history pattern detection |-- Dependency Auditor -- Package manifest validation |-- Test Scanner -- Test infrastructure detection |-- Metric Verifier -- README claims vs code reality |-- AI Model Validator -- ML-specific checks | v Verdict Engine -- Confidence-weighted final assessment | v JSON Report + Verdict

Core Components

Repository Fetcher — Handles cloning, caching, and file tree mapping. Supports public GitHub repositories with private repo support planned. Generates a complete file manifest with metadata (size, type, last modified).
Code Forensics Engine — Performs static analysis on every source file. Checks for function definitions matching claimed functionality, import/usage consistency, execution path coherence, code complexity metrics, and copy-paste detection.
Commit Analyzer — Examines the full git history for anomalous patterns. Detects single-day dumps, artificial inflation, bot-like timing, inconsistent authorship, and force-push evidence.
Dependency Auditor — Validates package manifests (package.json, requirements.txt, Cargo.toml, etc.) against actual import statements. Detects phantom dependencies, missing dependencies, and version inconsistencies.
Test Scanner — Searches for test files, test directories, CI/CD configuration, coverage reports, and test-to-code ratios. Distinguishes between meaningful tests and placeholder stubs.
Metric Verifier — Extracts numerical claims from README and documentation, then searches the codebase for evaluation scripts, benchmark datasets, and measurement code that would produce those numbers.
AI Model Validator — Specialized module for ML repositories. Checks architecture definitions, model weight availability, training script functionality, and cross-references claims with arXiv, HuggingFace, and PapersWithCode.
Verdict Engine — Aggregates signals from all modules using a weighted scoring algorithm. Produces a final verdict (LEGIT or FAKE REPO) with a confidence percentage and detailed evidence breakdown.

Technology Stack

AI backbone: Claude (Anthropic) for natural language analysis and reasoning
Static analysis: Custom AST parsers for Python, JavaScript, TypeScript, Rust, Go
Git analysis: libgit2 bindings for efficient history traversal
Infrastructure: Sandboxed execution environments for safe code analysis
API: RESTful JSON API with WebSocket support for real-time scan progress

Analysis Methodology

OpenClaw employs a multi-dimensional analysis approach. Each dimension contributes weighted signals to the final verdict, ensuring no single factor produces a false positive or negative.

Code Forensics

Every source file in the repository is analyzed for:

Function/class definitions that match README-claimed functionality
Import statements that correspond to actual usage in code
Execution paths that are logically coherent and reachable
Code complexity appropriate for the claimed project scope
Dead code ratio — high ratios suggest generated or copy-pasted code
Naming conventions and style consistency across the codebase

Commit Pattern Analysis

The git history reveals crucial information about how a project was actually developed. Suspicious patterns include:

Single-day dumps where the entire project appears in one or two commits
Artificial inflation through meaningless whitespace or formatting changes
Bot-like commit timing with unnaturally regular intervals
Inconsistent author information suggesting copied histories
Evidence of force-pushes that may hide the actual development timeline
Commit message patterns that suggest automated or bulk generation

Test Coverage Analysis

Legitimate software projects have testing infrastructure. We scan for:

Test files and test directories (tests/, __tests__/, spec/, etc.)
CI/CD configuration (.github/workflows, .gitlab-ci.yml, Jenkinsfile)
Coverage report artifacts and configuration
Test-to-code ratio analysis
Quality assessment: meaningful tests vs. placeholder stubs

Metric Verification

When repositories claim specific performance metrics (accuracy, speed, benchmarks), we verify:

Presence of evaluation scripts that could produce the claimed numbers
Benchmark datasets referenced and accessible
Reproducibility of claimed results given the code structure
Cross-referencing with published papers when cited

Dependency Audit

All declared dependencies are real, published packages
Declared dependencies are actually imported in source code
No critical dependencies are missing from the manifest
Version pinning follows security best practices
No known vulnerabilities in dependency tree

Scoring System

The verdict engine uses a weighted multi-signal scoring algorithm to produce two primary metrics: the Legitimacy Score and the Confidence Level.

Legitimacy Score (0-100)

Range	Classification	Description
90-100	Verified Legitimate	Well-structured, tested, documented, active development
70-89	Likely Legitimate	Minor concerns but generally solid codebase
50-69	Questionable	Significant gaps or concerns requiring manual review
30-49	Suspicious	Multiple red flags detected across analysis dimensions
0-29	Likely Fake	Overwhelming evidence of fraud or performative code

Confidence Level

Level	Threshold	Meaning
Very High	>90%	Multiple strong signals align across all dimensions
High	70-90%	Clear patterns with minor ambiguity
Medium	50-70%	Mixed signals requiring interpretation
Low	<50%	Insufficient data for confident assessment

Signal Weights

Each analysis module contributes a weighted signal to the final score. Default weights (configurable via strictness parameter):

code_forensics: 0.30 // Primary signal commit_analysis: 0.20 // History patterns test_coverage: 0.20 // Test infrastructure metric_verification: 0.15 // Claims vs reality dependency_audit: 0.10 // Package validation ai_model_check: 0.05 // ML-specific (if applicable)

API Reference

The ClawGit API provides programmatic access to the scanning engine. Currently in closed beta with limited access.

Scan Repository

POST /api/v1/scan // Request Body { "repo_url": "https://github.com/user/repo", "strictness": 50, "options": { "deep_scan": true, "include_ai_validation": true, "export_format": "json" } }

// Response { "verdict": "FAKE REPO", "confidence": 94, "legitimacy_score": 18, "summary": "All README, no substance", "flags": [ { "code": "ZERO_TESTS", "severity": "critical" }, { "code": "COMMIT_DUMP", "severity": "high" } ], "green_flags": [], "scan_time_ms": 11847, "engine_version": "0.1.0-beta" }

Get Scan Status

GET /api/v1/scan/{scan_id}/status // Response { "status": "in_progress", "progress": 67, "current_module": "commit_analysis", "elapsed_ms": 7200 }

Export Report

GET /api/v1/scan/{scan_id}/report?format=pdf // Returns downloadable PDF report // Status: Coming Soon

Rate Limits

Tier	Limit	Status
Beta	10 scans/hour	Active
Pro	100 scans/hour	Planned
Enterprise	Unlimited	Planned

Flag Taxonomy

OpenClaw uses a standardized taxonomy of red and green flags to categorize findings. Each flag is backed by specific evidence from the analysis.

Red Flags

Code	Severity	Description
GHOST_CODE	Critical	README describes functionality that doesn't exist in code
COMMIT_DUMP	High	Entire project committed in a single day
ZERO_TESTS	High	No test files, test configs, or coverage reports
PHANTOM_METRICS	High	Claimed performance with no evaluation code
DEAD_DEPS	Medium	Dependencies declared but never imported
STAR_INFLATION	Medium	Suspicious star/fork growth patterns
BOILERPLATE	Medium	Mostly template or generated code
MISSING_WEIGHTS	High	ML repo with no accessible model weights
PAPER_MISMATCH	Critical	Code architecture doesn't match claimed paper
COPY_PASTE	Medium	Large portions copied from other repositories

Green Flags

Code	Signal	Description
TESTED	Strong	Comprehensive test suite with meaningful coverage
CI_CD	Strong	Active continuous integration pipeline
COMMUNITY	Moderate	Multiple contributors, issues, pull requests
DOCUMENTED	Moderate	Inline docs, API docs, usage examples
VERSIONED	Moderate	Proper semantic versioning and changelogs
REPRODUCIBLE	Strong	Clear setup instructions that actually work
BENCHMARKED	Strong	Verifiable performance benchmarks included

Integration Guide

ClawGit can be integrated into your development workflow through multiple channels. All integrations use the same underlying OpenClaw engine.

GitHub Actions

# .github/workflows/clawgit.yml name: ClawGit Audit on: [pull_request] jobs: scan: runs-on: ubuntu-latest steps: - uses: clawgit/scan-action@v1 with: strictness: 60 fail-on-verdict: "FAKE REPO" api-key: ${{ secrets.CLAWGIT_API_KEY }}

CLI Tool (Planned)

# Install $ npm install -g @clawgit/cli # Scan a repository $ clawgit scan https://github.com/user/repo # Scan with custom strictness $ clawgit scan https://github.com/user/repo --strictness 80 # Export report $ clawgit scan https://github.com/user/repo --export pdf

Webhook Notifications

// Configure webhook endpoint POST /api/v1/webhooks { "url": "https://your-server.com/clawgit-hook", "events": ["scan.complete", "scan.failed"], "secret": "your-webhook-secret" }

Privacy and Security

Repository data is not stored after analysis completes
All scan results are encrypted in transit (TLS 1.3)
No source code is copied, retained, or used for training
Analysis runs in isolated, sandboxed environments
API keys are scoped and revocable
SOC 2 Type II compliance planned for enterprise launch

STOP TRUSTING GHOST CODE

HOW THECLAW WORKS

Drop the Link

OpenClaw Digs In

Verdict Delivered

WHAT WETEAR APART

Deep Code Forensics

Commit Pattern Analysis

Test Coverage Check

Metric Verification

Dependency Reality Check

AI Model Validation

Full Report Export Coming Soon

WHAT WEFOUND SO FAR

ADJUST THETRUST THRESHOLD

OPENCLAW AIAGENT DOCS

What is ClawGit?

The Problem

The Solution

Key Capabilities

Current Status

System Architecture

Pipeline Overview

Core Components

Technology Stack

Analysis Methodology

Code Forensics

Commit Pattern Analysis

Test Coverage Analysis

Metric Verification

Dependency Audit

Scoring System

Legitimacy Score (0-100)

Confidence Level

Signal Weights

API Reference

Scan Repository

Get Scan Status

Export Report

Rate Limits

Flag Taxonomy

Red Flags

Green Flags

Integration Guide

GitHub Actions

CLI Tool (Planned)

Webhook Notifications

Privacy and Security

JOIN THE WAITLIST

HOW THE
CLAW WORKS

WHAT WE
TEAR APART

WHAT WE
FOUND SO FAR

ADJUST THE
TRUST THRESHOLD

OPENCLAW AI
AGENT DOCS