Same GPU. Zero waste. Full power. GPU broker plus local control plane for modes, gates, and workstation fixes.
Mesma GPU. Zero desperdicio. Poder total. Broker de GPU com plano de controle local para modos, gates e fixes.
同一块GPU,零浪费,全部性能。74KB Python,零依赖。
Single daemon, stdlib-only Python, zero external dependencies. It arbitrates VRAM and also exposes modes, access gates, SSE events, and a 100-problem workstation catalog.
Daemon unico, Python stdlib puro, zero dependencias externas. Arbitra VRAM e tambem expõe modos, access gates, SSE e um catalogo de 100 problemas da workstation.
Priority-based leases with TTL. No workload exceeds the GPU budget. Automatic expiration of idle leases.
Leases baseados em prioridade com TTL. Nenhum workload excede o budget da GPU. Expiracao automatica de leases ociosos.
Adaptive throttling that scales with GPU contention. Prevents model load/unload storms that kill throughput.
Throttling adaptativo que escala com contencao da GPU. Evita tempestades de load/unload que destroem throughput.
The /cofit endpoint tells you which models can run in parallel, fitting within VRAM constraints.
O endpoint /cofit diz quais modelos rodam em paralelo, encaixando nos limites de VRAM.
Computes optimal sequential and parallel schedules for multi-model pipelines. Maximum GPU utilization.
Calcula schedules otimos sequenciais e paralelos para pipelines multi-modelo. Utilizacao maxima da GPU.
Named gates like browser-auth, meeting, and deploy can be claimed and released by local tabs and tools.
Gates nomeados como browser-auth, meeting e deploy podem ser reivindicados e liberados por abas e ferramentas locais.
/api/events streams mode and gate changes live. /sdk/gates.js lets browser tabs subscribe without inventing their own state store.
/api/events transmite mudancas de modo e gates ao vivo. /sdk/gates.js permite que abas do navegador assinem isso sem inventar outro estado.
A 100-problem catalog now ships with 100 implemented diagnostics for ports, services, Docker, git hygiene, browser groups, memory pressure, and more.
Um catalogo de 100 problemas agora embarca com 100 diagnosticos implementados para portas, servicos, Docker, git, grupos de browser, pressao de memoria e mais.
Register remote Corino-Gate nodes and aggregate health, mode, gates, and fix coverage into one control surface.
Cadastre nos remotos do Corino-Gate e agregue health, modo, gates e cobertura de fixes em uma unica superficie de controle.
Sensitive Hydra actions can require two fresh proofs: platform passkey plus hardware security key, both bound to method, path, and body hash.
Acoes sensiveis do Hydra podem exigir duas provas frescas: passkey de plataforma mais security key fisica, ambas vinculadas a metodo, rota e hash do corpo.
Three-level system: warn, pressure, emergency. Manages OOM scores. Prevents system-wide crashes from runaway processes.
Sistema de tres niveis: alerta, pressao, emergencia. Gerencia OOM scores. Previne crashes do sistema por processos descontrolados.
Detects unmanaged GPU processes consuming VRAM outside the broker. Full visibility into rogue workloads.
Detecta processos GPU nao gerenciados consumindo VRAM fora do broker. Visibilidade total de workloads nao autorizados.
Unused leases are auto-demoted. No resource hoarding. GPU time goes to workloads that actually need it.
Leases ociosos sao automaticamente rebaixados. Sem acumulacao de recursos. Tempo de GPU vai pra quem precisa.
Monitors GPU memory fragmentation. Alerts when allocation patterns degrade performance below SLO thresholds.
Monitora fragmentacao de memoria GPU. Alerta quando padroes de alocacao degradam performance abaixo dos limiares SLO.
Drains active requests before evicting. No killed inferences. Graceful handoff between priority levels.
Drena requests ativos antes de despejar. Nenhuma inferencia morta. Handoff gracioso entre niveis de prioridade.
After crashes, starts in conservative mode. Reduced allocations until stability is confirmed. Self-healing.
Apos crashes, inicia em modo conservador. Alocacoes reduzidas ate estabilidade confirmada. Auto-recuperacao.
Change policy.json and POST /reload. No restarts. Zero downtime configuration.
Altere policy.json e POST /reload. Sem restarts. Configuracao sem downtime.
Per-priority-class latency targets. The broker enforces response time contracts across competing workloads.
Alvos de latencia por classe de prioridade. O broker aplica contratos de tempo de resposta entre workloads concorrentes.
All endpoints accept and return JSON. The daemon binds to 127.0.0.1 by default.
Todos endpoints aceitam e retornam JSON. O daemon escuta em 127.0.0.1 por padrao.
| Method | Path | Description | Descricao |
|---|---|---|---|
| POST | /acquire | Request a VRAM lease | Solicitar um lease VRAM |
| POST | /release | Free a lease | Liberar um lease |
| POST | /renew | Extend lease TTL | Estender TTL do lease |
| POST | /cofit | Co-residency advisor | Consultor de co-residencia |
| POST | /priority | Set daily focus priority | Definir prioridade do dia |
| POST | /reload | Hot-reload policy.json | Recarregar policy.json |
| POST | /api/mode | Change control-plane mode | Alterar modo do plano de controle |
| GET | /api/gates | Access-gate snapshot | Snapshot dos access gates |
| POST | /api/gates/claim | Claim named gate | Reivindicar gate nomeado |
| POST | /api/gates/release | Release named gate | Liberar gate nomeado |
| GET | /api/events | SSE stream for local tabs | Stream SSE para abas locais |
| GET | /sdk/gates.js | Browser SDK for tabs | SDK browser para abas |
| GET | /api/fixes | 100-problem fix catalog | Catalogo de 100 problemas |
| POST | /api/fixes/run | Run implemented diagnostic or fix | Executar diagnostico ou fix implementado |
| GET | /api/fleet | Fleet snapshot | Snapshot da fleet |
| GET | /api/fleet/agent/status | Sanitized local node status | Status sanitizado do no local |
| POST | /api/fleet/nodes | Add or update fleet node | Adicionar ou atualizar no da fleet |
| POST | /api/fleet/nodes/remove | Remove fleet node | Remover no da fleet |
| GET | /ui | Local dashboard and control plane | Painel local e plano de controle |
| GET | /status | Full broker snapshot | Snapshot completo do broker |
| GET | /leases | Active leases | Leases ativos |
| GET | /ledger | GPU state ledger | Estado da GPU |
| GET | /health | Health check | Verificacao de saude |
A single daemon mediates all GPU access. Clients acquire leases before touching VRAM.
Um unico daemon intermedia todo acesso a GPU. Clientes adquirem leases antes de tocar a VRAM.
Run the installer and the first-run settings screen opens automatically.
Rode o instalador e a tela de definicoes abre automaticamente na primeira execucao.
# Extract and install
unzip corino-gate-linux.zip
cd corino-gate-linux
./install.sh
The app can launch the daemon for you, or you can start it manually.
A app pode iniciar o daemon para voce, ou voce pode subir manualmente.
corino-gate --first-run corino-gate-daemon &
CLI for humans, Python client for scripts, REST API for everything else.
CLI para humanos, cliente Python para scripts, API REST para todo o resto.
# CLI
gate status
gate cofit
from gate_client import gate_lease with gate_lease(tag="my_job", vram_mib=15000) as lease: run_inference()
Download Corino-Gate. Zero dependencies. One file. Full control.
Baixe o Corino-Gate. Zero dependencias. Um arquivo. Controle total.
Download Linux/WSL installer Baixar instalador Linux/WSL Windows beta installer Instalador Windows beta