Developer Guide¶

This guide provides comprehensive information for developers and administrators working with the AI Engineering Platform infrastructure.

Overview¶

The AI Engineering Platform consists of multiple components that work together to provide secure, isolated development environments and automated participant management. This guide covers deployment, configuration, and maintenance procedures.

Platform Components¶

1. Coder Server¶

Purpose: Provides containerized development environments
Deployment: GCP VM with Terraform
Documentation: See Coder Deployment

2. Participant Onboarding System¶

Purpose: Automated participant authentication and API key distribution
Components: Firebase Authentication, Firestore, Cloud Functions
Documentation: See Participant Onboarding

3. Onboarding Status Dashboard¶

Purpose: Real-time monitoring of participant onboarding status
Deployment: Next.js on Cloud Run with Load Balancer path-based routing
Access: https://platform.vectorinstitute.ai/onboarding

Infrastructure Deployment¶

Coder Server Deployment¶

Follow the comprehensive deployment guide in the coder/deploy/ directory.

Quick Start:

cd coder/deploy
terraform init
terraform plan
terraform apply

For detailed instructions, see coder/deploy/README.md.

Onboarding Status Web Dashboard¶

The onboarding status dashboard is deployed on Cloud Run and integrated with the main platform load balancer using path-based routing.

Setup Guide: Onboarding Status Web - Load Balancer Setup

This guide covers:

Configuring Next.js with basePath for path-based routing
Creating serverless Network Endpoint Groups (NEG)
Setting up backend services for Cloud Run
Configuring load balancer path matchers
Deployment and verification procedures
Troubleshooting common issues

Automated Deployment (Recommended):

The service is automatically deployed via GitHub Actions when changes are pushed to the main branch. The workflow: - Builds and tests the Docker container - Pushes to Google Artifact Registry - Deploys to Cloud Run - Verifies health checks

Manual Deployment:

./scripts/admin/deploy_onboarding_status_web.sh

Access URL:

https://platform.vectorinstitute.ai/onboarding

Required GitHub Secrets for Automated Deployment:

Configure these secrets in GitHub repository settings (Settings → Secrets and variables → Actions):

GCP_PROJECT_ID: Your GCP project ID (e.g., coderd)
WIF_PROVIDER: Workload Identity Federation provider
Format: projects/PROJECT_NUMBER/locations/global/workloadIdentityPools/POOL_ID/providers/PROVIDER_ID
GCP_SERVICE_ACCOUNT: Service account email for deployment
Format: SERVICE_ACCOUNT_NAME@PROJECT_ID.iam.gserviceaccount.com
Required roles: roles/run.admin, roles/iam.serviceAccountUser, roles/artifactregistry.admin
GH_ORG_TOKEN: Personal access token with read:org scope for checking GitHub organization membership

Service Architecture¶

Load Balancer Configuration¶

The platform uses a single Google Cloud Load Balancer to route traffic to multiple backend services:

platform.vectorinstitute.ai/
├── /                    → Coder Server (VM: coder-entrypoint)
├── /onboarding          → Cloud Run (onboarding-status-web)
└── /onboarding/*        → Cloud Run (onboarding-status-web)

Key Resources:

Resource	Name	Purpose
External IP	`coderd-https-lb-ip`	Static IP for load balancer
HTTPS Forwarding Rule	`coderd-https-forwarding-rule`	Routes HTTPS traffic
HTTPS Proxy	`coderd-https-proxy`	SSL termination
URL Map	`https-url-map`	Path-based routing configuration
Backend Service (Coder)	`coderd-backend`	Routes to Coder VM
Backend Service (Onboarding)	`onboarding-backend`	Routes to Cloud Run

Firebase Services¶

The platform uses Firebase for authentication and data storage:

Firebase Authentication: Custom token generation for participants
Firestore: Participant data, team assignments, and API keys
Firebase Security Rules: Enforce team-level data isolation

Administration¶

Deploying a New Bootcamp¶

When setting up a new bootcamp with its own GCP project and service account, complete this checklist before running the Coder template for the first time.

1. Grant Cloud Run Invoker permission to the workspace service account¶

The workspace VMs authenticate to the Firebase token service using their GCP service account identity. If the service account is not granted roles/run.invoker on the token service, the Cloud Run IAM layer rejects the request with a non-JSON response, causing the onboarding CLI to fail at Step 2 with a confusing JSON parse error.

gcloud run services add-iam-policy-binding firebase-token-service \
  --region=us-central1 \
  --project=coderd \
  --member="serviceAccount:<workspace-sa>@<project>.iam.gserviceaccount.com" \
  --role="roles/run.invoker"

Example (for the agentic-ai-evaluation-bootcamp project):

gcloud run services add-iam-policy-binding firebase-token-service \
  --region=us-central1 \
  --project=coderd \
  --member="serviceAccount:agentic-ai-evaluation-bootcamp@agentic-ai-evaluation-bootcamp.iam.gserviceaccount.com" \
  --role="roles/run.invoker"

Verify the policy:

gcloud run services get-iam-policy firebase-token-service \
  --region=us-central1 \
  --project=coderd

2. Register participants in Firestore¶

Participant data lives in the coderd GCP project, onboarding Firestore database. Use the admin CLI to load participants before starting any workspaces:

onboard admin setup-participants config/participants.csv

3. Provision team API keys¶

onboard admin create-gemini-keys \
  --project <bootcamp-gcp-project> \
  --bootcamp <bootcamp-name>

Participant Management¶

Adding Participants¶

Use the admin scripts to add new participants:

python scripts/admin/setup_participants.py

Requirements: - CSV file with participant information - Firebase admin credentials - Team assignments

Viewing Onboarding Status¶

Command Line:

onboard --admin-status-report --gcp-project coderd

Web Dashboard:

https://platform.vectorinstitute.ai/onboarding

The dashboard provides: - Real-time participant status - Onboarding completion rates - Filtering by status - CSV export functionality

Monitoring and Maintenance¶

Health Checks¶

Coder Server:

curl -I https://platform.vectorinstitute.ai/

Onboarding Dashboard:

curl -I https://platform.vectorinstitute.ai/onboarding

Onboarding API:

curl https://platform.vectorinstitute.ai/onboarding/api/participants

Log Access¶

Cloud Run Logs:

gcloud logging read "resource.type=cloud_run_revision AND resource.labels.service_name=onboarding-status-web" \
  --project=coderd \
  --limit=50 \
  --format=json

Coder Server Logs:

# SSH into VM
gcloud compute ssh coder-entrypoint --project=coderd --zone=us-central1-a

# View logs
sudo journalctl -u coder -f

Resource Management¶

List Active Services:

# Cloud Run services
gcloud run services list --project=coderd

# Compute instances
gcloud compute instances list --project=coderd

# Backend services
gcloud compute backend-services list --project=coderd