
repo-intel
io.github.nirholas/repo-intel
Analyze repos of any size - security scanning code analysis monorepo support
Documentation
Lyra Intel
Complete Intelligence Infrastructure Engine for Massive-Scale Codebase Analysis
π Full Documentation | Quick Start | Use Cases | API Reference
Analyze codebases 10-100x faster with AI-powered insights, security scanning, and semantic search.
β‘ Active Development
Lyra Intel is actively being enhanced with improvements daily. The core platform is production-ready and being used in enterprise deployments. Thank you for your contributions! π
Why Lyra Intel?
Most code analysis tools force a choice: automation at the cost of understanding, or manual inspection with no scale.
Lyra Intel is built on a different principle: Give developers and security teams the intelligence they need to make informed decisions at scale.
You get:
- β Complete visibility - Understand your entire codebase, not just highlighted issues
- β AI-powered insights - Get context and explanations, not just lists of problems
- β Security you control - Run locally or in your cloud, with no data leaving your infrastructure
- β Scale without sacrifice - Analyze 1 million lines or 1 billion lines with the same ease
- β Open source - Full transparency, no vendor lock-in, customize for your needs
Perfect for teams that want to own their code intelligence.
What is Lyra Intel?
Lyra Intel is a comprehensive, production-ready intelligence platform designed to understand, secure, and improve codebases of any size - from small projects to enterprise monorepos with millions of lines of code.
Unlike traditional linters or SonarQube-style tools, Lyra Intel combines:
- Deep code analysis (AST parsing, dependency graphs, complexity metrics)
- AI-powered insights (OpenAI, Anthropic, or local models)
- Semantic code search (ML-powered search beyond keywords)
- Security scanning (secrets, OWASP, CVE detection)
- Knowledge graphs (understand relationships in your code)
- Forensic analysis (find dead code, document gaps, technical debt)
Why You Need Lyra Intel
For Security Teams:
- Automatically find hardcoded secrets, SQL injection risks, OWASP vulnerabilities
- Track security across massive codebases without manual scanning
- Generate compliance reports (SOC2, HIPAA, PCI-DSS ready)
For Development Teams:
- Understand unfamiliar codebases in hours, not weeks
- Find dead code and technical debt before they become problems
- Make data-driven architectural decisions
- Detect complex bugs that static analysis misses
For Engineering Leaders:
- Quantify code quality and technical debt
- Track metrics across teams and projects
- Plan migrations and upgrades with confidence
- Reduce time spent on code reviews
What You Can Do
With 70+ specialized components, Lyra Intel enables:
| Goal | What Lyra Intel Does | Time Saved |
|---|---|---|
| Secure a legacy codebase | Scan for vulnerabilities, create remediation plan | Weeks β Hours |
| Onboard new developers | Build searchable knowledge base, find examples | Days β Hours |
| Plan a framework upgrade | Analyze impact, generate step-by-step migration plan | Months β Days |
| Understand technical debt | Quantify debt, track trends, prioritize fixes | Ongoing β Automated |
| Review pull requests | AI-powered insights + security checks + complexity analysis | 30 min β 5 min |
| Find security issues | Scan for 50+ vulnerability patterns in real-time | Manual β Automated |
π Features
Lyra Intel includes 70+ specialized components organized by capability:
View All Features (70+ Components)
Core Analysis - Understand Your Code
- π File Crawler - Parallel directory traversal with streaming for memory efficiency. Process millions of files without memory issues.
- π Git Collector - Complete commit history, blame analysis, contributor stats. Understand who changed what and when.
- π AST Analyzer - Multi-language syntax tree parsing (Python, JS/TS, Go, Rust, Java, C++, C#, Ruby, PHP). Get accurate code structure.
- π Dependency Mapper - Build complete dependency graphs with circular detection. Understand your architecture.
- β οΈ Pattern Detector - Find code smells, anti-patterns, security issues. Detect problems before they become expensive.
Scalability - From Laptop to Enterprise
- π₯οΈ Local Mode - Single machine analysis for development. No setup needed, runs instantly on your machine.
- π Distributed Mode - Multi-worker processing for larger codebases. Scale analysis to 100K+ files efficiently.
- βοΈ Cloud Massive Mode - Auto-scaling cloud infrastructure (AWS, GCP, Azure). Analyze monorepos with millions of files.
Storage Options - Flexibility for Any Scale
- SQLite - Local development and small projects. Built-in, no dependencies.
- PostgreSQL - Production deployments. Reliable, proven, scalable.
- BigQuery - Massive-scale analytics. Query 1M+ analysis results instantly.
- Cache Layer - Memory, File, Redis backends with TTL/LRU eviction. Speed up repeated analyses.
π Security - Find Vulnerabilities Before They Become Breaches
- Security Scanner - OWASP Top 10, hardcoded secrets, SQL injection detection. Scan 50+ vulnerability patterns.
- Vulnerability Database - Track known CVEs and advisories. Stay updated on emerging threats.
- Custom Rules - Define custom security patterns. Enforce your organization's security standards.
π€ AI Integration - Get Smarter Insights
- AI Analyzer - Code explanation, bug detection, refactoring suggestions. Understand complex code instantly.
- Multiple Providers - OpenAI (GPT-4/3.5), Anthropic (Claude), or Local (Ollama/llama.cpp). Choose what fits your workflow.
- Cost Effective - Local models for free analysis, or cloud models for maximum accuracy.
π Visualization & Reports - Communicate Results
- Graph Generator - Export to D3.js, Mermaid, Graphviz DOT. Visualize dependencies and architecture.
- Report Generator - Executive, Technical, Security, Architecture reports. Different reports for different audiences.
- Web Dashboard - Interactive D3.js/Cytoscape visualization. Explore your codebase visually.
π API & Enterprise Features
- REST API Server - 15+ endpoints for integration. Build on top of Lyra Intel.
- Authentication - API Key, JWT, OAuth 2.0 (SSO), LDAP support. Secure access control.
- RBAC - Role-based access control. Manage permissions across your team.
- Rate Limiting - Protect your infrastructure. Scale safely.
π¬ Forensic Analysis - Find Hidden Problems
- Forensic Analyzer - Codeβdoc bidirectional mapping. Find documentation gaps automatically.
- Dead Code Detector - Find unused functions, classes, imports. Clean up your codebase.
- Complexity Analyzer - Cyclomatic, Cognitive, Halstead metrics. Identify problematic code.
π More Capabilities
- Code Generation - AI-powered function/class/API generation with custom templates
- Diff & Impact Analysis - Understand what changed and why it matters
- Migration Planning - Plan framework/version upgrades with step-by-step guidance
- Code Profiling - Detect N+1 queries, blocking I/O, inefficient algorithms
- Schema Analysis - Database schema analysis from ORM models
- Documentation Generator - Auto-generate API docs and changelogs
- Workflow Engine - Define and execute multi-step analysis pipelines
π Auto-Discovery Pipeline (NEW)
- GitHub Scanner - Automatically discover new MCP crypto tools from GitHub
- AI Tool Analyzer - Extract tool definitions using AI/pattern matching
- Security Scanner - Scan discovered tools for vulnerabilities
- Registry Submitter - Submit approved tools to the Lyra Registry
- Daily Automation - GitHub Actions workflow for continuous discovery
π Complete Documentation
Lyra Intel includes comprehensive documentation covering every aspect of the platform:
Core Documentation
-
π FEATURES.md - Detailed feature documentation with code examples for:
- Semantic Search (ML-powered code search)
- SSO Integration (OAuth 2.0, SAML 2.0, LDAP)
- Language Parsers (C++, C#, Ruby, PHP)
- Plugin System
- IDE Extensions (VS Code, JetBrains)
- CI/CD Integrations (GitLab, Bitbucket, GitHub Actions)
- Export Formats (PDF, SARIF, Excel, CSV)
- WebSocket Streaming
- Interactive CLI
- Web Dashboard
- Monitoring & Metrics (Prometheus, Grafana)
-
π» EXAMPLES.md - Working code examples for:
- Quick start (60-second analysis)
- Core analysis workflows
- Semantic search usage
- SSO setup and configuration
- Language-specific parsing
- Custom plugin development
- IDE extension installation
- CI/CD pipeline integration
- Real-time WebSocket streaming
- Monitoring setup
- Complete end-to-end workflows
-
ποΈ ARCHITECTURE.md - Technical architecture documentation:
- System overview and design
- Core component architecture
- Data flow diagrams
- Module organization
- Extension points
- Deployment architectures (single server, Kubernetes, AWS)
- Performance & scalability
- Security architecture
- Technology stack
-
π API.md - Complete REST API reference
-
π DEPLOYMENT.md - Deployment guides (Docker, Kubernetes, AWS)
-
π openapi.yaml - OpenAPI 3.0 specification
Real-World Workflows
- πΌ USE_CASES.md - Practical workflows and best practices:
- Securing legacy codebases
- Pre-commit code quality gates
- CI/CD security pipelines
- Code review assistance
- Monorepo migration planning
- Technical debt tracking
- Building team knowledge bases
- Integration patterns
- Performance optimization tips
Getting Started Guides
- β‘ QUICKSTART.md - Get up and running in 5 minutes
- π§ INSTALL.md - Installation instructions
- π TUTORIAL.md - Step-by-step tutorials for common use cases:
- First analysis
- Security audit
- Semantic search setup
- CI/CD integration
- Custom plugin development
- Production deployment
- Real-time dashboard
- β FAQ.md - Frequently asked questions
- π€ CONTRIBUTING.md - Contribution guidelines
Quick Start (5 Minutes)
Get up and running in just a few commands. No complex setup needed.
# 1. Clone the repository
git clone https://github.com/nirholas/lyra-intel.git
cd lyra-intel
# 2. Install (requires Python 3.9+)
pip install -e .
# 3. Quick scan - see what Lyra Intel finds in 30 seconds
python cli.py scan /path/to/any/code
# 4. Full analysis - comprehensive report
python cli.py analyze /path/to/code --output ./results.json
# 5. View results
cat results.json | jq . # Pretty print the JSON
# 6. (Optional) Start the web dashboard
python launch_dashboard.py
# Then visit http://localhost:8080
What to Expect
After running scan, you'll see:
β
Analyzing repository...
π Files analyzed: 156
π Total functions: 1,247
β οΈ Issues found: 43
π Security findings: 5
Running analyze produces detailed JSON with:
- Metrics: Line counts, complexity, test coverage
- Security: Vulnerabilities, secrets detection
- Dependencies: Import relationships, circular deps
- Patterns: Code smells, anti-patterns
- Git history: Commit stats, contributors
πΌ Common Use Cases
Real teams use Lyra Intel for:
π Security Teams
"I need to scan our 500K LOC codebase for vulnerabilities"
- Secure a Legacy Codebase - Full audit in 30 min
- Automatic CI/CD security gates
- Pre-commit hooks that block insecure code
- Regular scheduled security scans
π¨βπ» Development Teams
"New developer is joining - how do we onboard them on 200K lines of code?"
- Build a Team Knowledge Base - Semantic search over your codebase
- Find similar code patterns
- Understand architecture through visualization
- Track technical debt
ποΈ Platform Teams
"We need to upgrade from Node 14 to Node 18 - is it safe?"
- Plan a Monorepo Migration - Step-by-step migration plan
- Analyze impact across all packages
- Identify breaking changes
- Estimate effort per package
π Engineering Leads
"Is our code quality improving or getting worse?"
- Track Technical Debt - Monthly trend tracking
- Visualize metrics over time
- Prioritize what to fix first
- Show data-driven reports to management
π Code Review
"Reviews are taking too long - 30 min per PR"
π€ MCP Integration (Claude & LLMs)
Use Lyra Intel directly from Claude, Claude Code, or any MCP-compatible LLM.
Quick Setup
# Claude Code - one command
npx lyra-intel-mcp
# Claude Desktop - add to config
{
"mcpServers": {
"lyra-intel": {
"command": "npx",
"args": ["-y", "lyra-intel-mcp"]
}
}
}
Available MCP Tools
| Tool | Description |
|---|---|
analyze-codebase | Comprehensive code analysis with AST, dependencies, metrics |
search-code | ML-powered semantic code search |
get-complexity | Cyclomatic, cognitive, and Halstead complexity |
get-security-issues | Security vulnerabilities, secrets, compliance |
discovery-scan-github | Find new MCP crypto tools on GitHub |
discovery-analyze-repo | Extract MCP tool definitions from repos |
discovery-run-pipeline | Full discovery + analysis + submission |
Example Prompts
"Analyze my project at ~/code/myapp for security issues"
"Search for authentication patterns in the codebase"
"Scan GitHub for new MCP crypto tools from the last 7 days"
"Run the discovery pipeline and submit approved tools"
See full MCP documentation β
ποΈ Architecture
- AI-Powered Code Review - Automated insights in 30 seconds
- Security analysis
- Complexity warnings
- AI suggestions for improvements
π See 7 complete workflows with code examples β
Architecture
lyra-intel/
βββ src/
β βββ core/ # Main engine orchestration
β βββ collectors/ # Data collection (files, git)
β βββ analyzers/ # Code analysis (AST, dependencies, patterns)
β βββ storage/ # Database and persistence
β βββ agents/ # Multi-agent system
β βββ search/ # Code and semantic search
β βββ query/ # Natural language queries
β βββ visualizers/ # Graph generation
β βββ reports/ # Report generation
β βββ web/ # Web dashboard
β βββ api/ # REST API server
β βββ auth/ # Authentication and authorization
β βββ plugins/ # Plugin system
β βββ ai/ # AI integration
β βββ metrics/ # Metrics collection
β βββ events/ # Event system
β βββ notifications/ # Notifications and alerts
β βββ forensics/ # Forensic analysis
β βββ cache/ # Caching layer
β βββ pipeline/ # Streaming pipeline
β βββ testing/ # Testing infrastructure
β βββ knowledge/ # Knowledge graph system
β βββ diff/ # Diff and impact analysis
β βββ generation/ # Code generation
β βββ security/ # Security scanning
β βββ migration/ # Migration planning
β βββ profiler/ # Performance profiling
β βββ schema/ # Schema analysis
β βββ docgen/ # Documentation generation
β βββ integrations/ # External integrations
β βββ workflow/ # Workflow engine
βββ config/ # Configuration files
βββ scripts/ # Utility scripts
βββ Dockerfile # Container build
βββ docker-compose.yml # Multi-service deployment
βββ cli.py # Command-line interface
Processing Modes
Local Mode
Best for development and small repositories:
from src import LyraIntelEngine, EngineConfig, ProcessingMode
config = EngineConfig(mode=ProcessingMode.LOCAL, max_workers=8)
engine = LyraIntelEngine(config)
result = await engine.analyze_repository("/path/to/repo")
Distributed Mode
For larger codebases with multiple workers:
config = EngineConfig(
mode=ProcessingMode.DISTRIBUTED,
max_workers=50,
)
Cloud Massive Mode
For enterprise-scale analysis:
config = EngineConfig(
mode=ProcessingMode.CLOUD_MASSIVE,
cloud_provider="aws",
cloud_region="us-east-1",
max_cloud_workers=1000,
)
Analysis Results
The engine produces comprehensive analysis including:
- File metrics: Total files, sizes, line counts by extension
- Code structure: Functions, classes, methods with complexity scores
- Dependencies: Import/export relationships, circular dependencies
- Git history: Commits, authors, change frequency
- Patterns: Code smells, anti-patterns, security issues
Results are stored in SQLite (or your configured backend) and can be exported as JSON.
Cloud Support
Lyra Intel is designed to leverage cloud resources efficiently:
| Provider | Instance Types | Spot Support | Optimization |
|---|---|---|---|
| AWS | EC2, Lambda, ECS | β Supported | ~70% savings |
| GCP | Compute Engine, Cloud Run | β Supported | ~70% savings |
| Azure | VMs, Functions | β Supported | ~70% savings |
Auto-scaling and cost optimization features included.
How Lyra Intel Compares
| Feature | Lyra Intel | SonarQube | Snyk | GitHub Advanced Security |
|---|---|---|---|---|
| Open Source | β MIT | β Commercial | β Proprietary | β οΈ Limited |
| Semantic Code Search | β ML-powered | β No | β No | β No |
| AI Integration | β Any provider | β No | β No | β GitHub Copilot only |
| Monorepo Support | β Up to 1M files | β οΈ Limited | β Good | β Good |
| Self-Hosted | β Full | β οΈ Enterprise only | β οΈ Limited | β GitHub-hosted |
| Cost | β Free | π°π°π° | π°π° | π° |
| Knowledge Graph | β Automatic | β No | β No | β No |
| Forensic Analysis | β Dead code, debt | β οΈ Basic | β No | β οΈ Basic |
| Migration Planning | β Automated steps | β No | β No | β No |
| Multi-Language | β 10+ languages | β Many | β οΈ JS/Python focus | β Many |
| Real-time Dashboard | β React UI | β Yes | β Yes | β Yes |
Bottom line: Lyra Intel is best for teams that want deep code understanding + AI insights + full control, all open source.
π£οΈ Roadmap
β Phase 1: Core Platform (Complete)
- Complete analysis engine with 70+ components
- Multi-language parsing (10+ languages)
- Dependency graphing and pattern detection
- Git history analysis and forensics
- Security scanning (50+ patterns)
- AI integration (OpenAI, Anthropic, Ollama)
β Phase 2: Enterprise Features (Complete)
- REST API with 15+ endpoints
- Web dashboard with interactive visualizations
- Knowledge graph and semantic search
- RBAC, SSO, and authentication
- Code generation and migration planning
- IDE plugins (VS Code, JetBrains)
β Phase 3: Scale & Performance (Complete)
- Distributed analysis for 100K+ files
- Cloud massive mode (AWS/GCP/Azure auto-scaling)
- Real-time streaming analysis
- ML-based code review
- Performance profiling and optimization
- Schema analysis and workflow engine
π Phase 4: Advanced Features (In Progress)
- Enhanced ML models for code understanding
- Custom model fine-tuning
- Advanced compliance reporting
- Real-time dashboard improvements
- Performance benchmarking suite
π Future Phases
- Automated remediation suggestions
- Integration with more CI/CD platforms
- Mobile app for dashboard access
- Advanced visualization options
- Community plugin marketplace
π Metrics & Monitoring
Access metrics at:
- Prometheus:
http://localhost:9090 - Grafana:
http://localhost:3000 - API Health:
http://localhost:8080/api/v1/health
Key metrics:
lyra_intel_requests_total- Total API requestslyra_intel_analysis_duration_seconds- Analysis performancelyra_intel_ai_tokens_total- AI usage trackinglyra_intel_cache_hits_total- Cache efficiency
π€ Contributing
We welcome contributions! Please see CONTRIBUTING.md for guidelines.
π Troubleshooting
Common issues and solutions:
Database connection failed
docker-compose restart postgres
docker-compose logs postgres
High memory usage
# Reduce workers
export WORKERS=4
# Increase memory limit
docker-compose up -d --scale api=1 --memory 4g
API rate limit
# Increase rate limits in config
export RATE_LIMIT_PER_MINUTE=1000
See DEPLOYMENT.md for comprehensive troubleshooting.
π Project Status
- β Core analysis engine
- β Multi-language support (10+ languages)
- β AI integrations (OpenAI, Anthropic, Ollama)
- β Security scanning (OWASP, secrets, dependencies)
- β Export formats (JSON, HTML, PDF, SARIF, CSV, Excel)
- β IDE plugins (VS Code, JetBrains)
- β Platform integrations (GitHub, GitLab, Bitbucket)
- β Cloud deployment (AWS, Kubernetes, Docker)
- β Real-time streaming (WebSocket)
- β Web dashboard (React)
- β Monitoring (Prometheus, Grafana)
- β Enterprise features (SSO, RBAC, audit logs)
π Show Your Support
If you find Lyra Intel helpful, consider:
- β Star this repository - It helps others discover the project
- π Report issues - Help us improve by reporting bugs
- π‘ Share ideas - Suggest features and improvements
- π€ Contribute - See CONTRIBUTING.md for guidelines
- π’ Spread the word - Share with your team and community
Every star, contribution, and mention helps grow the community!
π Acknowledgments
Built with amazing open-source tools:
- OpenAI & Anthropic - AI models
- FastAPI - Web framework
- React - UI framework
- Prometheus - Monitoring
- PostgreSQL - Database
π§ Contact & Support
- Issues & Bug Reports: GitHub Issues
- Documentation: Full Documentation
- Contributing: See CONTRIBUTING.md
Made with β€οΈ for developers, security teams, and engineering leaders.
π License
MIT License - see LICENSE file for details.
Made withβ€οΈby nich | Follow me on X.com
No installation packages available.
Related Servers
ai.smithery/eliu243-oura-mcp-server-2
Connect your Oura Ring account to enable secure, authenticated access in your workflows. Generateβ¦
ai.smithery/eliu243-oura-mcp-server-eliu
Connect your Oura Ring account securely in minutes. Enable authorized access to your sleep, activiβ¦
com.adspirer/ads
Remote MCP for cross-platform ad creation (Google Ads, TikTok). OAuth 2.1 with progress streaming.