API Catalog

Separation of Duties: See Separation of Duties - API documentation is responsible for documenting APIs. It does NOT own agent manifests, execution, or infrastructure configuration.

Complete listing of all APIs across the Bluefly LLM Platform ecosystem.

Overview

Total APIs: 240+ services across 6 categories Architecture: OpenAPI-First, RESTful, gRPC where applicable Documentation: All services maintain OpenAPI 3.1 specifications Authentication: Unified JWT/OAuth 2.0 with GitLab integration

API Categories

1. Agent Services (12 APIs)

Core agent communication, coordination, and intelligence services.

Service	Port	Domain	Description
Agent Router	3001	`gateway.local.bluefly.io`	LLM Gateway & API orchestration
Agent Protocol	3050	`protocol.local.bluefly.io`	MCP coordinator for LLM ecosystem
Agent Mesh	3040	`mesh.local.bluefly.io`	Multi-agent communication mesh with gRPC
Agent Brain	3010	`agent-brain.local.bluefly.io`	Cognitive processing & reasoning
Agent Tracer	3100	`tracer.local.bluefly.io`	Operations intelligence & tracing
Agent Chat	3020	`chat.local.bluefly.io`	Multi-agent chat with LibreChat
Agent Docker	3030	`docker.local.bluefly.io`	Container intelligence & management
Agent Studio	3060	`studio.local.bluefly.io`	AI agent development IDE
Agent Ops	3002	`ops.local.bluefly.io`	DevOps operations management

OpenAPI Specs:

2. Workflow Engines (4 APIs)

Workflow orchestration, compliance, and automation engines.

Service	Port	Domain	Description
Workflow Engine	3090	`workflow.local.bluefly.io`	Langflow-compatible visual workflows
Compliance Engine	3080	`compliance.local.bluefly.io`	FedRAMP, NIST, HIPAA, GDPR compliance
Doc Engine	3018	`docs.local.bluefly.io`	Documentation generation & management
Agentic Flows	3070	`flows.local.bluefly.io`	Flow orchestration & coordination

OpenAPI Specs:

Key Features:

Visual workflow builder (Langflow integration)
Real-time compliance monitoring
FedRAMP Moderate baseline support
NIST AI RMF 1.0 implementation
Automated audit logging

3. Local Models (26 FREE Models)

Zero-cost local inference via Ollama MCP integration.

Provider	Endpoint	Cost	Models Available
Ollama MCP	`http://localhost:11435`	$0/month	26 models

Featured Models:

qwen2.5-coder:32b (19GB) - Best for code generation
gpt-oss:120b (65GB) - General purpose reasoning
deepseek-coder-v2:16b (9GB) - Fast code completion
codellama:34b (19GB) - Meta's code model
gov-rfp-model (4.4GB) - Government RFP analysis
bfrfp-llm_platform_model (7.4GB) - Platform custom model

API Endpoints:

POST   /invoke           # MCP tool invocation
GET    /tools            # List available tools/models
POST   /api/generate     # Direct inference
GET    /api/tags         # List installed models
GET    /health           # Health check

Cost Savings: 96-99% reduction vs. Claude/OpenAI APIs

Before: $500-2000/month (API costs)
After: $20/month (Cursor Pro only)

OpenAPI Spec: /technical-guide/openapi/agent-protocol/local-models-mcp-api.yaml

4. Gateway & Routing (2 APIs)

Unified API gateway and intelligent request routing.

Service	Port	Domain	Description
LLM Gateway	3001	`gateway.local.bluefly.io`	Unified LLM Gateway (Agent Router)
MCP Registry	3000	`mcp-registry.local.bluefly.io`	MCP connection registry & load balancing

Gateway Capabilities:

Multi-provider routing (OpenAI, Anthropic, Google, Cohere, Ollama)
Intelligent load balancing
Response caching
Rate limiting (1000 req/min per key)
Cost optimization routing
Automatic failover

Supported Providers:

OpenAI (GPT-4, GPT-3.5, DALL-E, Whisper)
Anthropic (Claude 3, Claude 2)
Google (Gemini Pro, PaLM)
Cohere (Command, Embed)
Ollama (26 local models)
Custom models (GovRFP, Agent BuildKit)

5. Drupal Module APIs (30+ Endpoints)

REST APIs for Drupal custom modules.

Module	Description	Key Endpoints
AI Agent Orchestra	GitLab ML integration, customer health	`/api/ingest`, `/api/predict`, `/api/recommendations`
AI Agents	OSSA v0.1.9 agent management	`/api/agents`, `/api/execute`, `/api/webhooks`
API Normalizer	Multi-provider API normalization	`/api/normalize`, `/api/schema/import`, `/api/batch`
MCP Registry	MCP server registry & health	`/api/mcp/servers`, `/api/mcp/bundles`, `/api/mcp/health`

Base URL: http://drupal.local.bluefly.io:8080

OpenAPI Specs: /technical-guide/openapi/drupal-modules/

6. Build Tools & Infrastructure (8 APIs)

Development tooling and infrastructure management.

Service	Port	Description
Git Policy	9002	Git policy enforcement (blocks shell scripts)
Markdown Migration	9001	GitLab wiki migration (5,298 files)
GitLab Integration	9010	Issue & wiki curation service
Ecosystem Management	9000	Self-evolving ecosystem (OSSA v0.1.9)
Agent BuildKit	-	Main agent development CLI
OSSA Framework	-	Open Standards for Scalable Agents

API Integration Patterns

MCP Routing Pattern

How services call local models through MCP protocol:

graph LR
    A[Service] --> B[Agent Protocol MCP Coordinator]
    B --> C[Ollama MCP Tool Server]
    C --> D[Select Best Model]
    D --> E[Return Result $0 cost]

Example Request:

POST /invoke
{
  "tool": "generate_code",
  "input": {
    "prompt": "Create TypeScript service",
    "model": "qwen2.5-coder:32b"
  }
}

Response:

{
  "result": {
    "output": "...",
    "model": "qwen2.5-coder:32b"
  },
  "cost": 0,
  "duration": 2.5
}

Authentication

All APIs use unified authentication via Compliance Engine:

JWT Tokens: Bearer authentication for service-to-service
OAuth 2.0: GitLab OAuth for user authentication
API Keys: For external integrations
Rate Limiting: Configurable per service

See Authentication Guide for details.

OpenAPI Registry

Master Registry: /technical-guide/openapi/master-api-registry.yaml

All OpenAPI specs are:

OpenAPI 3.1 compliant
Type-safe with generated TypeScript types
Validated with Redocly
Synchronized with live services
Published to GitLab wiki

API Versioning

All APIs follow semantic versioning:

v1.x.x: Production stable
v0.x.x: Development/beta
Breaking changes: Major version bump required

Rate Limits

Default rate limits per service:

Service Type	Requests/Min	Burst
Gateway	1000	1500
Agent Services	500	750
Engines	200	300
Drupal APIs	100	150

Rate limits configurable via environment variables.

Health Monitoring

All services expose standardized health endpoints:

GET /health

Response:

{
  "status": "healthy",
  "version": "0.4.9",
  "uptime": 86400,
  "dependencies": {
    "database": "healthy",
    "redis": "healthy",
    "mcp": "healthy"
  }
}

API Catalog

API Catalog

Overview

API Categories

1. Agent Services (12 APIs)

2. Workflow Engines (4 APIs)

3. Local Models (26 FREE Models)

4. Gateway & Routing (2 APIs)

5. Drupal Module APIs (30+ Endpoints)

6. Build Tools & Infrastructure (8 APIs)

API Integration Patterns

MCP Routing Pattern

Authentication

OpenAPI Registry

API Versioning

Rate Limits

Health Monitoring

Next Steps