Commit graph

36 commits

Author SHA1 Message Date
bd09761def
nightlight: add durable grpc runtime 2026-03-31 21:29:14 +09:00
9dfe86f92a
fix cluster resiliency gaps across VM watch, runtime health, and FlareDB routing 2026-03-31 14:09:45 +09:00
1698009062
Simplify DNS publication state and FlashDNS storage 2026-03-31 11:14:18 +09:00
aba1b73d5b
Tighten cluster builds and add record-set DNS publication 2026-03-31 10:03:40 +09:00
b75766af0b
Implement IAM tenant registry and privileged admin surfaces 2026-03-31 01:23:16 +09:00
37f5479ab8
Add daemon scheduling for native services
Some checks failed
Nix CI / filter (push) Failing after 1s
Nix CI / gate () (push) Has been skipped
Nix CI / gate (shared crates) (push) Has been skipped
Nix CI / build () (push) Has been skipped
Nix CI / ci-status (push) Failing after 1s
2026-03-30 21:31:32 +09:00
ce979d8f26
fiberlb: add BGP interop, drain, and policy validation
Some checks failed
Nix CI / filter (push) Successful in 6s
Nix CI / gate () (push) Failing after 1s
Nix CI / gate (shared crates) (push) Has been skipped
Nix CI / build () (push) Has been skipped
Nix CI / ci-status (push) Failing after 1s
2026-03-30 20:06:08 +09:00
67d4523adf
Strengthen FiberLB multi-peer BGP verification
Some checks failed
Nix CI / filter (push) Failing after 1s
Nix CI / gate () (push) Has been skipped
Nix CI / gate (shared crates) (push) Has been skipped
Nix CI / build () (push) Has been skipped
Nix CI / ci-status (push) Failing after 1s
2026-03-30 17:41:24 +09:00
63c7251756
Harden FiberLB native BGP control plane
Some checks failed
Nix CI / filter (push) Failing after 1s
Nix CI / gate () (push) Has been skipped
Nix CI / gate (shared crates) (push) Has been skipped
Nix CI / build () (push) Has been skipped
Nix CI / ci-status (push) Failing after 1s
2026-03-30 16:46:35 +09:00
ce4bab07d6
fiberlb: add native BGP speaker and VM smoke test
Some checks failed
Nix CI / filter (push) Failing after 1s
Nix CI / gate () (push) Has been skipped
Nix CI / gate (shared crates) (push) Has been skipped
Nix CI / build () (push) Has been skipped
Nix CI / ci-status (push) Failing after 1s
2026-03-30 16:13:14 +09:00
96d46a3603
Integrate topology-driven bootstrap into nix-nos
Some checks failed
Nix CI / filter (push) Successful in 8s
Nix CI / gate (shared crates) (push) Has been skipped
Nix CI / gate () (push) Failing after 5s
Nix CI / build () (push) Has been skipped
Nix CI / ci-status (push) Failing after 1s
2026-03-30 14:39:28 +09:00
795b8ad70c
Add prebuilt system closure support for host rollouts
Some checks failed
Nix CI / filter (push) Successful in 7s
Nix CI / gate () (push) Failing after 1s
Nix CI / gate (shared crates) (push) Has been skipped
Nix CI / build () (push) Has been skipped
Nix CI / ci-status (push) Failing after 1s
2026-03-30 13:54:14 +09:00
d6d96b8c37
Move native runtime seed state into declarative Nix
Some checks failed
Nix CI / filter (push) Successful in 6s
Nix CI / gate () (push) Failing after 1s
Nix CI / gate (shared crates) (push) Has been skipped
Nix CI / build () (push) Has been skipped
Nix CI / ci-status (push) Failing after 1s
2026-03-28 07:38:45 +09:00
9d21e2da95
Add PrismNet-backed PlasmaVMC matrix coverage
Some checks failed
Nix CI / filter (push) Failing after 1s
Nix CI / gate () (push) Has been skipped
Nix CI / gate (shared crates) (push) Has been skipped
Nix CI / build () (push) Has been skipped
Nix CI / ci-status (push) Failing after 1s
2026-03-28 03:14:11 +09:00
e1a5d394e5
ci: unify workspace inventory and harden tier0 gating
Some checks failed
Nix CI / filter (push) Successful in 54s
Nix CI / gate (shared crates) (push) Has been skipped
Nix CI / gate () (push) Failing after 6s
Nix CI / build () (push) Has been skipped
Nix CI / ci-status (push) Failing after 1m14s
2026-03-28 00:09:22 +09:00
290c6ba88a
Expand gateway matrix coverage and fix test-cluster routing 2026-03-27 22:51:37 +09:00
6fa172eab1
Implement host lifecycle orchestration and distributed storage restructuring 2026-03-27 12:14:12 +09:00
a7d5cfa738
Wire bootstrap installers to explicit targets and reboot-aware rollout 2026-03-21 16:43:00 +09:00
88e78d1602
Add commissioning facts and PXE bootstrap wiring 2026-03-21 16:42:33 +09:00
fbcbb4e5dc
Add bootstrap flake bundle delivery and Nix desired-system generation 2026-03-20 17:43:26 +09:00
edd2442267
Add desired-system state and health-gated nix-agent rollback 2026-03-20 17:09:59 +09:00
de60f087d6
Add nix-agent and close NixOS reconcile loop 2026-03-20 16:56:59 +09:00
203f21ff29
Factor vm-cluster cluster state and export deployer artifacts 2026-03-20 16:29:34 +09:00
ed0f9f42f4
WIP snapshot: preserve dirty worktree 2026-03-20 16:25:11 +09:00
1b9ed0cd22
ci: optimize workflow with paths-filter and workspace-aware gates 2025-12-24 18:22:22 +09:00
d3d74995e8
chore: initial sync of untracked files and infrastructure components 2025-12-24 18:21:55 +09:00
4bfe75a1d7 feat(flaredb): Add --http-addr CLI flag and region peer management API
- Add --http-addr CLI flag for HTTP REST bind address
- Fix config env var parsing (FLAREDB_HTTP_ADDR wasn't working due to separator conflict)
- Add GET /api/v1/regions/{id} endpoint to view region info
- Add POST /api/v1/regions/{id}/add_peer endpoint for multi-peer region management
- Update NixOS module to use --http-addr 0.0.0.0 CLI flag instead of env var

This enables FlareDB region cluster formation with multiple peers.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2025-12-19 16:54:06 +09:00
9a72c8d3ec fix(nix): Bind REST APIs to 0.0.0.0 for cluster join
- chainfire.nix: CHAINFIRE__NETWORK__HTTP_ADDR env var
- flaredb.nix: FLAREDB_HTTP_ADDR env var
- first-boot-automation.nix: jq-based config reading

Fixes ChainFire crash: "unexpected argument '--http-addr' found"

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2025-12-19 08:14:44 +09:00
5586929e98 fix(nix): Add creditservice.enable + fix CLI args
- Add services.creditservice.enable = true to all node configs
- Add firewall port 3010 (gRPC) for creditservice
- Fix creditservice.nix CLI: --listen-addr/--http-addr (not --port/--data-dir)
- Add CREDITSERVICE_CHAINFIRE_ENDPOINT environment variable
- Updated S4 test script to expect 11 services (was 10)

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2025-12-19 00:17:48 +09:00
54e3a16091 fix(nix): Align service ExecStart with actual binary CLI interfaces
- chainfire: Fix binary name (chainfire-server → chainfire)
- fiberlb: Use --grpc-addr instead of --port
- flaredb: Use --addr instead of --api-addr/--raft-addr
- flashdns: Add --grpc-addr and --dns-addr flags
- iam: Use --addr instead of --port/--data-dir
- k8shost: Add --iam-server-addr for dynamic IAM port connection
- lightningstor: Add --in-memory-metadata for ChainFire fallback
- plasmavmc: Add ChainFire service dependency and endpoint env var
- prismnet: Use --grpc-addr instead of --port

These fixes are required for T039 production deployment. The
plasmavmc change specifically fixes the ChainFire port mismatch
(was hardcoded 50051, now uses chainfire.port = 2379).

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2025-12-18 22:58:40 +09:00
3eeb303dcb feat: Batch commit for T039.S3 deployment
Includes all pending changes needed for nixos-anywhere:
- fiberlb: L7 policy, rule, certificate types
- deployer: New service for cluster management
- nix-nos: Generic network modules
- Various service updates and fixes

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2025-12-13 04:34:51 +09:00
8317b22b9e fix(nix): Remove deprecated max_retries from Prometheus config
- Remove queue_config.max_retries option from observability.nix
- Option deprecated/removed in recent NixOS/Prometheus versions
- Found by nix eval audit (T039.S3 pre-deployment validation)

Error: services.prometheus.remoteWrite."[...]".queue_config.max_retries' does not exist

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2025-12-12 08:49:30 +09:00
4547dacc7e feat(nix): Add creditservice module for NixOS deployment
- Add creditservice.nix module for credit service deployment
- Update default.nix to import creditservice module
- Required for T039.S3 NixOS provisioning

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2025-12-12 08:35:20 +09:00
d2149b6249 fix(lightningstor): Fix SigV4 canonicalization for AWS S3 auth
- Replace form_urlencoded with RFC 3986 compliant URI encoding
- Implement aws_uri_encode() matching AWS SigV4 spec exactly
- Unreserved chars (A-Z,a-z,0-9,-,_,.,~) not encoded
- All other chars percent-encoded with uppercase hex
- Preserve slashes in paths, encode in query params
- Normalize empty paths to '/' per AWS spec
- Fix test expectations (body hash, HMAC values)
- Add comprehensive SigV4 signature determinism test

This fixes the canonicalization mismatch that caused signature
validation failures in T047. Auth can now be enabled for production.

Refs: T058.S1
2025-12-12 06:23:46 +09:00
5c6eb04a46 T036: Add VM cluster deployment configs for nixos-anywhere
- netboot-base.nix with SSH key auth
- Launch scripts for node01/02/03
- Node configuration.nix and disko.nix
- Nix modules for first-boot automation

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2025-12-11 09:59:19 +09:00
baa3e038f9 Add NixOS service modules to git tracking
The nix/modules directory was untracked, causing flake evaluation to fail
when referencing ./nix/modules. This adds 9 service module definitions
created during T024 NixOS packaging.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2025-12-09 17:34:41 +09:00