API Catalog
API Catalog
Separation of Duties: See Separation of Duties - API documentation is responsible for documenting APIs. It does NOT own agent manifests, execution, or infrastructure configuration.
Complete listing of all APIs across the Bluefly LLM Platform ecosystem.
Overview
Total APIs: 240+ services across 6 categories Architecture: OpenAPI-First, RESTful, gRPC where applicable Documentation: All services maintain OpenAPI 3.1 specifications Authentication: Unified JWT/OAuth 2.0 with GitLab integration
API Categories
1. Agent Services (12 APIs)
Core agent communication, coordination, and intelligence services.
| Service | Port | Domain | Description |
|---|---|---|---|
| Agent Router | 3001 | gateway.local.bluefly.io | LLM Gateway & API orchestration |
| Agent Protocol | 3050 | protocol.local.bluefly.io | MCP coordinator for LLM ecosystem |
| Agent Mesh | 3040 | mesh.local.bluefly.io | Multi-agent communication mesh with gRPC |
| Agent Brain | 3010 | agent-brain.local.bluefly.io | Cognitive processing & reasoning |
| Agent Tracer | 3100 | tracer.local.bluefly.io | Operations intelligence & tracing |
| Agent Chat | 3020 | chat.local.bluefly.io | Multi-agent chat with LibreChat |
| Agent Docker | 3030 | docker.local.bluefly.io | Container intelligence & management |
| Agent Studio | 3060 | studio.local.bluefly.io | AI agent development IDE |
| Agent Ops | 3002 | ops.local.bluefly.io | DevOps operations management |
OpenAPI Specs:
2. Workflow Engines (4 APIs)
Workflow orchestration, compliance, and automation engines.
| Service | Port | Domain | Description |
|---|---|---|---|
| Workflow Engine | 3090 | workflow.local.bluefly.io | Langflow-compatible visual workflows |
| Compliance Engine | 3080 | compliance.local.bluefly.io | FedRAMP, NIST, HIPAA, GDPR compliance |
| Doc Engine | 3018 | docs.local.bluefly.io | Documentation generation & management |
| Agentic Flows | 3070 | flows.local.bluefly.io | Flow orchestration & coordination |
OpenAPI Specs:
Key Features:
- Visual workflow builder (Langflow integration)
- Real-time compliance monitoring
- FedRAMP Moderate baseline support
- NIST AI RMF 1.0 implementation
- Automated audit logging
3. Local Models (26 FREE Models)
Zero-cost local inference via Ollama MCP integration.
| Provider | Endpoint | Cost | Models Available |
|---|---|---|---|
| Ollama MCP | http://localhost:11435 | $0/month | 26 models |
Featured Models:
- qwen2.5-coder:32b (19GB) - Best for code generation
- gpt-oss:120b (65GB) - General purpose reasoning
- deepseek-coder-v2:16b (9GB) - Fast code completion
- codellama:34b (19GB) - Meta's code model
- gov-rfp-model (4.4GB) - Government RFP analysis
- bfrfp-llm_platform_model (7.4GB) - Platform custom model
API Endpoints:
POST /invoke # MCP tool invocation GET /tools # List available tools/models POST /api/generate # Direct inference GET /api/tags # List installed models GET /health # Health check
Cost Savings: 96-99% reduction vs. Claude/OpenAI APIs
- Before: $500-2000/month (API costs)
- After: $20/month (Cursor Pro only)
OpenAPI Spec: /technical-guide/openapi/agent-protocol/local-models-mcp-api.yaml
4. Gateway & Routing (2 APIs)
Unified API gateway and intelligent request routing.
| Service | Port | Domain | Description |
|---|---|---|---|
| LLM Gateway | 3001 | gateway.local.bluefly.io | Unified LLM Gateway (Agent Router) |
| MCP Registry | 3000 | mcp-registry.local.bluefly.io | MCP connection registry & load balancing |
Gateway Capabilities:
- Multi-provider routing (OpenAI, Anthropic, Google, Cohere, Ollama)
- Intelligent load balancing
- Response caching
- Rate limiting (1000 req/min per key)
- Cost optimization routing
- Automatic failover
Supported Providers:
- OpenAI (GPT-4, GPT-3.5, DALL-E, Whisper)
- Anthropic (Claude 3, Claude 2)
- Google (Gemini Pro, PaLM)
- Cohere (Command, Embed)
- Ollama (26 local models)
- Custom models (GovRFP, Agent BuildKit)
5. Drupal Module APIs (30+ Endpoints)
REST APIs for Drupal custom modules.
| Module | Description | Key Endpoints |
|---|---|---|
| AI Agent Orchestra | GitLab ML integration, customer health | /api/ingest, /api/predict, /api/recommendations |
| AI Agents | OSSA v0.1.9 agent management | /api/agents, /api/execute, /api/webhooks |
| API Normalizer | Multi-provider API normalization | /api/normalize, /api/schema/import, /api/batch |
| MCP Registry | MCP server registry & health | /api/mcp/servers, /api/mcp/bundles, /api/mcp/health |
Base URL: http://drupal.local.bluefly.io:8080
OpenAPI Specs: /technical-guide/openapi/drupal-modules/
6. Build Tools & Infrastructure (8 APIs)
Development tooling and infrastructure management.
| Service | Port | Description |
|---|---|---|
| Git Policy | 9002 | Git policy enforcement (blocks shell scripts) |
| Markdown Migration | 9001 | GitLab wiki migration (5,298 files) |
| GitLab Integration | 9010 | Issue & wiki curation service |
| Ecosystem Management | 9000 | Self-evolving ecosystem (OSSA v0.1.9) |
| Agent BuildKit | - | Main agent development CLI |
| OSSA Framework | - | Open Standards for Scalable Agents |
API Integration Patterns
MCP Routing Pattern
How services call local models through MCP protocol:
graph LR A[Service] --> B[Agent Protocol MCP Coordinator] B --> C[Ollama MCP Tool Server] C --> D[Select Best Model] D --> E[Return Result $0 cost]
Example Request:
POST /invoke { "tool": "generate_code", "input": { "prompt": "Create TypeScript service", "model": "qwen2.5-coder:32b" } }
Response:
{ "result": { "output": "...", "model": "qwen2.5-coder:32b" }, "cost": 0, "duration": 2.5 }
Authentication
All APIs use unified authentication via Compliance Engine:
- JWT Tokens: Bearer authentication for service-to-service
- OAuth 2.0: GitLab OAuth for user authentication
- API Keys: For external integrations
- Rate Limiting: Configurable per service
See Authentication Guide for details.
OpenAPI Registry
Master Registry: /technical-guide/openapi/master-api-registry.yaml
All OpenAPI specs are:
- OpenAPI 3.1 compliant
- Type-safe with generated TypeScript types
- Validated with Redocly
- Synchronized with live services
- Published to GitLab wiki
API Versioning
All APIs follow semantic versioning:
- v1.x.x: Production stable
- v0.x.x: Development/beta
- Breaking changes: Major version bump required
Rate Limits
Default rate limits per service:
| Service Type | Requests/Min | Burst |
|---|---|---|
| Gateway | 1000 | 1500 |
| Agent Services | 500 | 750 |
| Engines | 200 | 300 |
| Drupal APIs | 100 | 150 |
Rate limits configurable via environment variables.
Health Monitoring
All services expose standardized health endpoints:
GET /health
Response:
{ "status": "healthy", "version": "0.4.9", "uptime": 86400, "dependencies": { "database": "healthy", "redis": "healthy", "mcp": "healthy" } }