Infrastructure

0. Quick Overview

If you are new, use this section before the detailed steps.

Scope in this guide:

Control Center -> Infrastructure pages only (Dashboard, Zones, Pods, Clusters, Hosts)

What you will do in this section:

Validate dashboard health.
Verify zone baseline, then configure Pod -> Cluster -> Host in that order.
Confirm host readiness, then continue to VM workflow.

New-user flow at a glance:

Login -> Dashboard check -> Zone -> Pod -> Cluster -> Host -> Final validation -> VM workflow

Tooltips and help drawer:

When available on a page, use the page help icon (top-right) when field labels are unclear.
Start with the Zones page help drawer for terms like Allocation State and Network Type.
Confirm each required field meaning before creating or editing infrastructure objects.

Critical warnings before you change anything:

Do not continue if dashboard cards are all zero (fix Bootstrap first).
Do not create overlapping pod subnets or invalid IP ranges.
Do not delete clusters with attached hosts or running/stopped VMs.
Do not proceed to workload onboarding while host state is Down, Disconnected, or Alert.

Quick troubleshooting entry points:

Dashboard issues: go to Dashboard Troubleshooting.
Zone issues: go to Zone Troubleshooting.
Pod issues: go to Pod Troubleshooting.
Cluster issues: go to Cluster Troubleshooting.
Host issues: go to Host Troubleshooting.

Common screen checks

Use these checks for read-only dashboard, overview, and details screens in Infrastructure before retrying a page action:

Confirm you are in the correct site, zone, pod, cluster, or host context.
Verify the parent object for the page exists, is healthy enough to load, and is visible to your role.
Open Observability and review recent Events/Alerts for the resource name and timestamp before retrying.

1. Overview

This section is the first Control Center workflow after ISO install, Bootstrap, and login.

Goal:

Understand Infrastructure pages
Configure core objects in correct order
Validate readiness before VM setup

Tip

Always follow this sequence: Zone -> Pod -> Cluster -> Host.

1.1. HA Proxy (Default System Behavior)

Note

For first-time infrastructure setup, focus on Zone -> Pod -> Cluster -> Host completion first. HA proxy is enabled by default and is not a user-configurable toggle in the UI.

What is HA Proxy:

HA proxy is the traffic-handling layer that helps distribute service requests and improve availability across backend services.

Why it matters:

It improves service continuity during host/service changes.
It reduces single-path dependency for service traffic handling.

Current behavior:

Traffic handling is more resilient
Service interruption risk is reduced during changes/failures
HA proxy is enabled by default for clusters.
Users cannot enable or disable HA proxy from cluster actions.

Production implications:

Validate failover behavior in a maintenance window before production cutover.
Keep topology and ownership documentation current for incident response.

Validation:

Cluster view shows HA proxy state as active/default.
No user HA toggle action is expected in cluster actions.

2. Prerequisites

Before starting, confirm:

Karios ISO install is complete
Bootstrap completed without critical errors
You can log in to Karios portal
Your role has access to Control Center

Role note:

Zone baseline is created by the platform administrator during Bootstrap. New operators verify zones and then create pods/clusters/hosts as permitted.

Pre-check table:

Check	Where to verify	Expected result
Login access	Portal login page	Successful login
Control Center access	Left navigation	`Dashboard`, `Zones`, `Pods`, `Clusters`, `Hosts` visible
Bootstrap state	Dashboard cards	Cards are not all zero

RBAC action map (minimum expected access):

Page	Needed action access	If missing
Dashboard	View infrastructure	Request dashboard visibility access from admin
Zones	View zone details	Request `Zone View` (and `Zone Manage` if you must edit/create)
Pods	Create/update pod	Request `Pod Manage` access
Clusters	Create/update cluster	Request `Cluster Manage` access
Hosts	Add/manage hosts	Request `Host Manage / Onboarding` access

Warning

If all cards are zero and no infrastructure objects exist, do not continue with pod/cluster/host creation. Fix Bootstrap first.

3. Infrastructure Basics (For New Users)

If this is your first time with infrastructure terms, use these simple meanings:

Term	What it means	Why it matters
Zone	Top-level boundary for a location or environment (for example, `us-east-prod`). A zone defines the main networking context and acts as the parent container for infrastructure objects.	Pods, clusters, and hosts are scoped under a zone. If the zone is wrong, all child resources are placed in the wrong environment.
Pod	Network grouping inside a zone. A pod defines host-side networking values such as gateway, netmask, and management IP allocation range.	Hosts consume IPs from pod ranges. Good pod design prevents IP conflicts and simplifies scale-out.
Cluster	Logical group of hosts sharing the same hypervisor type and management model. Clusters are where capacity and scheduling decisions are made.	VMs are placed on hosts through cluster capacity, so cluster sizing and health directly impact workload stability.
Host	Physical server with hypervisor and BMC management access. It contributes CPU, memory, storage connectivity, and runtime execution for VMs.	This is the actual compute layer. If hosts are down or misconfigured, clusters cannot run workloads.

Simple hierarchy:

Zone -> Pod -> Cluster -> Host -> Virtual Machine

Real-world analogy:

Zone: Building
Pod: Floor/network segment
Cluster: Room of servers
Host: One physical server in that room

How these topics connect:

You choose the Zone first (where resources belong).
You define Pod networking (how hosts get management addresses).
You create a Cluster (where hosts are grouped for scheduling).
You onboard Hosts (actual hardware capacity for VMs).

4. New User Flow

Use this first-time flow:

Login -> Control Center -> Dashboard check
-> Verify Zone exists
-> Create/verify Pod
-> Create/verify Cluster
-> Add/verify Host
-> Final validation -> move to VM workflow

Decision branch for first-time setup:

If no zone exists: verify Bootstrap from Control Center -> Dashboard and Bootstrap logs first. Then follow your operating model: delegated infra roles can create the baseline zone; admin-owned environments should request administrator creation.
If zone exists but disabled: request enablement before creating pods/clusters.

5. Control Center Dashboard

Open Dashboard Page

When to Use:

Use this at the start of an Infrastructure check or before making Infrastructure changes.

Purpose:

Open the Dashboard and verify site health, capacity, and active alerts from one page.
This baseline check helps you avoid changes during degraded conditions.

Path:

Control Center -> Dashboard

Steps:

Click the Control Center icon in the far-left rail.
Click Dashboard.
The Control Center dashboard opens on the infrastructure overview page.

Expected Outcome:

You confirm baseline infrastructure health, capacity headroom, and alert posture before moving to object-level actions.

What you see first:

Top summary cards (Zones, Pods, Clusters, Hosts, VMs, Services).
Overview capacity widgets for compute, storage, and network.

Estimated time: 1-2 minutes

Control Center dashboard with infrastructure cards and metrics

What this screenshot shows:

Control Center Dashboard with top summary cards for Zones, Pods, Clusters, Hosts, VMs, and Services
Overview section with Compute, Storage, and Network utilization widgets
Health widgets for Security Compliance, Performance Health, and Avg Latency

Action in this screen:

Verify summary-card counts match expected environment footprint
Verify compute, storage, and network utilization values are within normal range
Verify compliance/health/latency widgets do not show abnormal risk trends

If this fails:

See Common screen checks before retrying this screen.

5.1. Summary and Overview Components

Component	What it shows	What new users should do
`Zones / Pods / Clusters / Hosts` cards	Current infrastructure object counts.	Confirm baseline objects exist before continuing to Zones/Pods/Clusters/Hosts pages.
`VMs` card	Total VMs and running count.	Confirm workload activity matches expectation before maintenance changes.
`Services` card	Platform service count and current service health state.	If service health is not healthy, validate dependency health before making changes.
`Compute` widget	Memory, CPU, GPU, and CPU core usage vs total.	Confirm there is enough compute headroom for new workloads.
`Storage` widget	Instance storage used/provisioned, image storage, and cluster capacity usage.	Confirm storage growth is controlled and not near critical thresholds.
`Network` widget	Management/shared/public IP usage and guest network type distribution.	Confirm available IP capacity before adding hosts or deploying VMs.
`Security Compliance` widget	Compliance score and finding categories.	Track trend and escalate if score drops unexpectedly.
`Performance Health` widget	Overall platform health score from dependency checks.	Use as a go/no-go signal before onboarding more infrastructure objects.
`Avg Latency` widget	Latency trend graph with day-over-day change indicator.	Investigate sudden spikes before capacity or placement actions.

When to Use:

Use this as the continuation view of the same Dashboard page, after reviewing the first screenshot view.

Purpose:

The same Dashboard page also includes management server health, dependency status, IPAM/DCIM inventory, and recent alerts.

Steps:

Stay on Control Center -> Dashboard.
Scroll further down the same Dashboard page.
Review management/dependency/site/inventory/alert components.
Continue only when there are no blocking health signals.

Expected Outcome:

You complete a full dashboard readiness check across both screenshot views of the same page before moving to object-level actions.

Control Center dashboard continuation view with additional infrastructure widgets

What this screenshot shows:

Control Center Dashboard continued view (second screenshot) from the same single page
Additional dashboard cards/widgets used for infrastructure validation and follow-up checks

What you can find:

Management-server status and core platform dependency health
Site, connectivity, IPAM, and DCIM inventory context
Bare-metal discovery/provisioning status and latest infrastructure alerts

If this fails:

See Common screen checks before retrying this screen.

5.2. Site, Operations, and Alert Components

Component	What it shows	What new users should do
`Management Servers`	Active management node identity, role, IP, and OS.	Confirm management plane is up before performing infrastructure operations.
`Platform Dependencies`	Dependency status for services such as management server, MySQL, InfluxDB, PostgreSQL, and Ceph.	If any dependency is not `Up`, stop non-essential changes and triage first.
`Network & IPAM`	VRFs, prefixes, IP ranges, IP addresses, and VLAN inventory counts.	Validate address-management inventory before network or host expansion.
`Site & Connectivity`	Site identity, location, facility metadata, VLAN/node counts, and NTP endpoints.	Confirm site metadata and NTP values are correct for operational consistency.
`Forge — Bare Metal`	Bare-metal lifecycle counters (discovered, ready, provisioned, configured, in progress) by vendor flow.	Use this widget to track onboarding pipeline state before host assignment.
`DNS - DCIM`	DNS/domain details and DCIM inventory counters (devices, racks, sites, cables).	Validate inventory visibility and DNS baseline before scaling infrastructure.
`Alerts`	Latest infrastructure-impact alerts with timestamp and `View all` link.	Open `View all` and prioritize repeated/critical alert patterns before continuing.

5.3. How to Use the Dashboard

Open Control Center -> Dashboard.
Review summary cards and overview health widgets at the top.
Scroll on the same page to review management/dependency/site/inventory/alert widgets.
Continue to Zones only after dashboard checks are healthy.

5.4. Summary Cards (Click and Hover Behavior)

How to use summary cards:

Hover each card for a quick definition.
Click any card to open its resource list for detailed actions.
Use card counts as your first infrastructure health checkpoint before edits.

Clickable elements on this page:

Top summary card headings (Zones, Pods, Clusters, Hosts, VMs, Services) are clickable and route to their respective pages.
Component action labels such as View All (for example in alerts/compliance areas when shown) open the related full page.
On list pages (Zones, Pods, Clusters, Hosts), selecting a row opens the corresponding detail page.
When a widget or section title is rendered as a link in the UI, selecting it routes to that section’s full page.

5.4.1. Access Dashboard Help Page

When to Use:

Use this during dashboard review when a card/widget label or metric meaning is unclear.

Purpose:

Context help ensures you interpret dashboard health and capacity indicators correctly before taking action.

Navigation:

Control Center -> Dashboard

Steps:

Open the Dashboard page.
Click the page help icon in the top-right.
Search for the metric or field name you need.
Use inline info icons beside headings/labels for quick definitions.

Expected Outcome:

You can interpret dashboard metrics with consistent definitions before proceeding to infrastructure actions.

Quick card actions:

Zones: open zone boundaries and verify the target environment.
Pods: open pod networking and capacity grouping.
Clusters: open cluster readiness and placement context.
Hosts: open host health and available compute capacity.
VMs: open VM state and investigate total-vs-running gaps.
Services: open dependency health details in Observability -> Diagnostics before provisioning changes.

If this fails:

See Common screen checks before retrying this screen.

5.4.2. Summary Card Reference

When to Use:

Use this while reading top summary cards and deciding where to drill down next.

Purpose:

Provide a quick card-by-card mapping of meaning, click behavior, and most common user task.

Steps:

Read the card name and count.
Match it with the table below to confirm meaning and default action path.
Click the card to open the target page for detailed review.

Expected Outcome:

You can interpret each summary card correctly and navigate directly to the right operational page.

Card	What it means	Click action and common task
`Zones`	Top-level infrastructure boundaries (datacenter/availability domains).	Opens the Zones list. Use to validate zone readiness and allocation state.
`Pods`	Rack-level groupings inside a zone.	Opens the Pods list. Use to verify pod network ranges and capacity grouping.
`Clusters`	Host groups with shared hypervisor/storage behavior.	Opens the Clusters list. Use to validate HA, managed state, and placement.
`Hosts`	Physical compute servers available for workload placement.	Opens the Hosts list. Use to onboard hosts or review host health/utilization.
`VMs`	Total VM count and running VM count.	Opens `Compute -> Virtual Machines`. Use to investigate stopped/errored VMs.
`Services`	Platform dependency service count and overall health.	Opens `Observability -> Diagnostics`. Use to identify degraded/down dependencies and latency impact.

Recent alerts visibility:

The dashboard can show alerts as a dedicated card or as a Recent Alerts widget, depending on layout.
Use View all to open Observability -> Alerts and triage critical items first.

If this fails:

See Common screen checks before retrying this screen.

5.5. Dashboard Areas (What to Use and When)

The dashboard is one continuous page with multiple operational areas:

Dashboard Area	What it gives you	When to use it
Top summary and overview area	Summary cards, capacity widgets (compute/storage/network), and health widgets.	Use for daily readiness checks and pre-change validation.
Site and inventory area	Management server, site/connectivity metadata, IPAM, and DCIM inventory context.	Use before network, host, and capacity expansion work.
Dependency and alert area	Platform dependency health plus latest alert feed with drill-down action.	Use when health is degraded or error/incident signals are visible.

5.6. Capacity and Health Guidance

When to Use:

Use this during daily checks, before provisioning changes, and whenever dashboard health appears degraded.

Purpose:

Provide clear interpretation rules for capacity widgets and health indicators so decisions are consistent.

Steps:

Review display mode and color status for Compute, Storage, and Network.
Validate latency and performance health trends before making changes.
Check compliance and recent alerts, then proceed only when risk is acceptable.

Expected Outcome:

You make go/no-go infrastructure decisions using consistent thresholds and panel behavior.

Display controls used in capacity panels:

Donut view: quick utilization percentage.
List view: used/total breakdown in bar format.
Expand: full-screen review for detailed inspection.

Color coding used across capacity views:

Green (0-69%): healthy headroom.
Orange (70-89%): monitor and plan expansion.
Red (90%+): action required immediately.

Compute panel guidance:

Why: verify CPU, memory, GPU, and core headroom before new VM placement.
How: hover donuts to read used/available/total and utilization.
Outcome: clear go/no-go decision for compute scaling.

Storage panel guidance:

Why: validate Instance Storage and Image Storage growth trends before provisioning.
How: review used vs total; expand panel if trend context is needed.
Outcome: avoid deployment failures caused by insufficient storage capacity.

Network panel guidance:

Why: ensure private and public IP pools can support upcoming deployments.
How: review used and total IP counts in panel tooltips.
Outcome: prevent VM provisioning blocks caused by exhausted IP ranges.

Ceph storage metric (if available):

Use as raw cluster storage trend visibility.
Interpret it as raw capacity, not directly usable capacity after replication overhead.

Security compliance and health widgets:

Security Compliance: use compliance score, risk level, and findings to prioritize remediation.
Health: use overall status, service state, and uptime as operational readiness checks.
Recent Alerts: use as immediate triage queue; open View All for full context and follow-up.

Avg latency panel guidance:

< 50 ms: healthy response times.
50-100 ms: acceptable but monitor trend.
> 100 ms: elevated; check Observability -> Diagnostics for the slow dependency.
Red upward day-over-day change badge means latency increased vs yesterday and should be investigated if sustained.
Green downward badge means latency improved vs yesterday.

Performance health gauge guidance:

The gauge score reflects dependency readiness and latency posture (higher is healthier).
What these stats mean? opens Observability -> Diagnostics for per-dependency status, latency, and errors.
Use gauge drops as a stop signal for non-essential provisioning changes until dependency health stabilizes.

Forge bare metal panel guidance:

Vendor groups include SUPERMICRO, DELL iDRAC, and HPE iLO.
Lifecycle states:
- Discovered: detected, not yet ready
- Ready: inventoried and ready for provisioning
- Provisioned: OS/hypervisor installed
- Configured: fully configured for workload use
- In Progress: provisioning/configuration running
Use Manage to open bare-metal orchestration for deeper lifecycle actions.

If this fails:

See Common screen checks before retrying this screen.

5.7. Site and Inventory Validation

Use these dashboard site/inventory cards when you need environment correctness checks before infrastructure changes.

What to validate:

Location and Site Info: confirm correct site and environment context.
DNS Configuration: confirm domain and resolver endpoints are correct.
NTP Servers: confirm primary/secondary/tertiary time sources are configured.
Site Statistics: check node and capacity counts for expansion readiness.
IPAM: verify prefixes, ranges, and address inventory headroom.
DCIM: verify physical inventory visibility (devices, racks, cables, manufacturers).

Why this matters:

Correct site metadata avoids wrong-site operations.
Correct DNS/NTP avoids service discovery and timestamp consistency issues.
Correct IPAM/DCIM baseline reduces expansion and troubleshooting errors.

5.8. Platform Dependency Checks

Use the Platform Dependencies card for real-time control-plane health checks.

Summary bar interpretation:

Service Ready: overall readiness state (Ready/Partial/Unavailable).
Dependencies: total monitored dependency count.
Healthy: healthy dependency count vs total.
Avg Latency: average service response time.
Errors: observed error count in the active window.

Dependencies table outcome model:

UP with normal latency and low errors: healthy service path.
DEGRADED: service reachable but at risk; monitor and stabilize.
DOWN: service unavailable; investigate immediately and expect feature impact.

When to act immediately:

Service Ready is not Ready.
Any critical dependency is DOWN.
Latency and error trends spike together across dependencies.

5.9. Dashboard Warnings and Troubleshooting

Warning

Do not continue with provisioning or topology edits when dashboard health is degraded or critical alerts are active.

Problem	Likely cause	What to do
High alert count on dashboard	Active platform incidents	Open `Observability -> Alerts`, prioritize critical alerts, and stabilize first.
Capacity panel above 90%	Resource saturation	Add capacity or reduce load before onboarding additional workloads.
Platform Dependencies shows unhealthy state	One or more dependencies are degraded/down	Open the dependency list, identify affected services, then continue triage in `Observability -> Diagnostics`.
VMs count much higher than running count	Stopped or errored VM population	Open `Compute -> Virtual Machines` and review VM state before continuing.
IP-related deployment failures	IP pool exhaustion or range mismatch	Validate dashboard `Network & IPAM` capacity and update ranges per policy.
Time/certificate anomalies	NTP configuration drift	Validate dashboard site/connectivity NTP values and restore synchronized time sources.

Expected Outcome:

Cards are populated
Metrics are visible (after host/storage onboarding)

Do not proceed if:

Cards remain all zero
Dashboard shows persistent load/errors

Tip

Check this page daily for capacity trend and health at a glance.

6. Zones

Zones are the top-level infrastructure boundary. As a new user, think of a zone as the place where all downstream compute and network objects live. Start here to confirm you are in the correct environment before creating pods and clusters.

Zone key concepts:

Zone: top-level infrastructure boundary, aligned to a datacenter or availability domain.
Allocation State: Enabled allows new allocations; Disabled blocks new allocations while existing workloads continue.
Network Type: networking model for the zone (Advanced).
DNS: public and internal DNS settings used for resolution.
Guest CIDR: guest network address space reserved for VM networking.

Verify-only flow:

Verify that Bootstrap already created your target zone.
If no required zone exists, stop and request administrator action to create it.

Note

Zones are created during the initial site Bootstrap and reflect your datacenter’s physical layout. All pods, clusters, and hosts reside within a zone.

Open Zones Dashboard

When to Use:

Use this when validating zone readiness before pod/cluster work.

Purpose:

Zones define the top-level infrastructure boundary and allocation state for all downstream objects.

Path:

Control Center -> Zones

Steps:

Click the Control Center icon in the far-left rail.
Click Zones.
The Zones dashboard opens with the zone list and top summary widgets.

Expected Outcome:

You confirm the target zone exists, is enabled, and is ready for pod and cluster operations.

What you see first:

Zones table with zone name, status, and DNS/network columns.
Zone-level summary widgets above the table.

Estimated time: 2-3 minutes

Zones dashboard showing zone list and status summary

What this screenshot shows:

Zone list with columns for Status, Network Type, Primary DNS, and Secondary DNS
Top utilization summary rings for zone-level capacity/health context

Action in this screen:

Confirm zone exists and is Enabled
Open zone details for DNS/network validation

Help on this page (use before changes):

Click the page help icon in the top-right to open the help drawer.
Use the help search box to find terms like Allocation State and Network Type.
Use inline info icons beside labels to confirm field meaning.

If zone is missing:

Do not create a zone from this workflow.
Raise an administrator request to complete zone Bootstrap for the target site.
Continue only after the required zone appears and is enabled.

Zones table columns:

Column	Description
Zone Name	Display name of the zone. Select the row to open zone details.
Status	Allocation state of the zone (enabled/disabled for new allocations).
Network Type	Zone networking model.
Primary DNS	Primary public DNS resolver.
Secondary DNS	Secondary public DNS resolver.

If this fails:

See Common screen checks before retrying this screen.

6.1. Access Zone Help Page

When to Use:

Use this before editing zone-related settings or when you need field definitions.

Purpose:

The help panel provides immediate, page-specific definitions so you can validate terminology before taking action.

Navigation:

Control Center -> Zones

Steps:

Open the Zones dashboard.
Click the page help icon in the top-right.
In the help drawer, search for field names such as Allocation State or Network Type.
Apply the guidance, then continue with validation or actions on the same page.

Zones page with Help Center quick reference drawer open

What this screenshot shows:

Built-in page help drawer with field-by-field definitions (for example allocation state and network type)
Search box to quickly find documentation for current page terms

Expected Outcome:

You confirm field meaning and avoid incorrect configuration decisions.

If this fails:

See Common screen checks before retrying this screen.

6.2. Access Zone Details Page

When to Use:

Use this before creating pods/clusters or when validating zone readiness.

Purpose:

Zone details confirms allocation state, DNS/domain settings, and guest CIDR scope.
Validating this first prevents downstream network and placement issues.

Navigation:

Control Center -> Zones

Steps:

Open the Zones list page.
Click the target zone row (Zone Name) to open its details page.
Review Basic Information and Network Configuration cards.
Confirm Allocation State is enabled and DNS/guest-network values match your plan.

Expected Outcome:

You confirm the selected zone is safe for pod/cluster onboarding.
You can catch zone-level misconfiguration before provisioning actions.

Zone details page with basic information, network configuration, and pod list

What this screenshot shows:

Zone detail header with enabled state
Basic Information and Network Configuration cards (domain, guest CIDR, DNS, internal DNS)
Embedded pod list with gateway and attached clusters/hosts counts

What you can find:

Zone state, network type, DNS/domain, and overall readiness

Zone detail cards and fields:

Basic Information:

Field	Description
Name	Human-readable zone name.
ID	System identifier used for API and internal references.
Allocation State	Controls whether new resources can be provisioned in this zone.
Network Type	Networking model used by this zone.
Zone Token	Internal authentication token used for secure zone communication.
Description / Display Text	Optional descriptive labels if configured.

DNS Configuration:

Field	Description
Domain / Domain ID / Domain Name	Zone domain identity used for name resolution context.
DNS 1 / DNS 2	Primary and fallback public DNS resolvers.
Internal DNS	Internal resolver used for management-domain name lookups.

Network Configuration:

Field	Description
Guest CIDR Address	Address space reserved for guest VM network allocation.
Routed Mode Enabled	Indicates routed networking mode where available.
Router Private/Public Interface Max MTU	Interface MTU values where exposed by your environment.

Note

Some zone fields are deployment-dependent. If a field is not shown in your UI, treat it as not exposed in your current environment.

Expected Outcome:

At least one enabled zone is ready for pod placement

Do not proceed if:

No usable zone exists
Zone remains disabled

Zone checklist:

Item	Validation
Status	`Enabled`
DNS	Primary DNS configured
Scope	Correct zone selected for your environment

Next step:

Go to Control Center -> Pods and verify/create the pod for this zone.

If this fails:

See Common screen checks before retrying this screen.

7. Pods

Pods define host management networking inside a zone. This is where you decide gateway, subnet, and host IP range. Correct pod values are required before onboarding hosts, because hosts consume these addresses.

Pod key concepts:

Zone: each pod belongs to one zone.
Clusters: pods contain one or more clusters.
Instance Storage: workload storage attached at pod/cluster scope.
Image Storage: used for VM templates, Boot Images, and snapshots.

Pod operational tips:

Pods are created during initial zone setup.
Modifying a pod network configuration can impact all clusters and hosts inside that pod. Plan and validate changes carefully before applying.
If a pod allocation state is Disabled, new VMs and additional resources cannot be provisioned in clusters under that pod.

Create vs verify:

Verify existing pod settings first.
Create a new pod only when you need a new network segment or no usable pod exists.

Open Pods Dashboard

When to Use:

Use this when validating existing pod network ranges or preparing to add a new pod.

Purpose:

Pod network settings (gateway, netmask, IP range) directly impact host onboarding and cluster growth.

Path:

Control Center -> Pods

Steps:

Click the Control Center icon in the far-left rail.
Click Pods.
The Pods dashboard opens with the pod list and create action.

Expected Outcome:

You can verify pod-to-zone mapping and network-range readiness before cluster and host actions.

What you see first:

Pod table with zone, gateway, netmask, and IP range columns.
Create Pod action in the top-right.

Estimated time: 3-5 minutes

Pods dashboard showing pod list and network range columns

What this screenshot shows:

Pod list with Zone, Gateway, Netmask, Start IP, and End IP
Create Pod action and top-level capacity indicators

Action in this screen:

Review existing pods
Click Create Pod to add a new pod

If this fails:

See Common screen checks before retrying this screen.

Create Pod (When Required)

When to Use:

Perform this action when no existing pod satisfies the required management subnet and IP range for the target zone.

Purpose:

Create a pod when you need a new management network segment or when no existing pod matches your zone/network plan.

Navigation:

Control Center -> Pods

Steps:

On the Pods dashboard, click Create Pod.
In the Create Pod drawer, select Zone.
Enter Pod Name.
Enter Gateway and Netmask.
Enter Start IP and End IP.
Submit with Create Pod.

Create Pod side panel showing required network fields

What this screenshot shows:

Create Pod drawer with required fields: zone, pod name, gateway, netmask, start IP, and end IP.
Form submit action for creating the pod after validation.

Expected Outcome:

The pod appears in the list under the selected zone.
Pod details show the configured gateway/netmask/IP range.
If allocation state is enabled, the pod is ready for cluster and host onboarding.

How to verify:

Confirm the new pod row appears in Control Center -> Pods.
Open pod details and verify Zone, Gateway, Netmask, Start IP, and End IP match submitted values.
Confirm Allocation State is enabled.

If unavailable:

If Create Pod is not visible, verify role permissions and zone scope.
If the button is disabled, correct mandatory field validation errors and retry.
If creation continues to fail, use Pod Troubleshooting.

If this fails:

See Common screen checks before retrying this screen.

7.1. Access Pod Help Page

When to Use:

Use this before submitting pod create/edit actions when field intent is unclear.

Purpose:

The help panel and info icons provide page-specific definitions that reduce network-value input errors.

Navigation:

Control Center -> Pods

Steps:

Open the Pods dashboard.
Click the page help icon in the top-right.
Search for field names such as Gateway, Netmask, or Allocation State.
Use inline info icons beside labels for quick field confirmation.

Expected Outcome:

You validate pod field expectations before submission and reduce avoidable validation failures.

Pods table columns:

Column	Description
Name	Pod display name.
Allocation State	Pod enablement state for new allocations.
Zone	Parent zone of this pod.
Gateway	Pod network gateway.
Netmask	Pod subnet mask.

Create Pod field map:

Field	What to enter
`Zone`	Target zone where hosts will be onboarded.
`Pod Name`	Unique pod identifier aligned with your naming standard.
`Gateway`	Pod gateway IP for the selected subnet.
`Netmask`	Netmask matching the gateway and host allocation range.
`Start IP`	First host allocation IP in subnet (exclude gateway).
`End IP`	Last host allocation IP in subnet (must be greater than `Start IP`).

Create Pod form behavior (UI):

The form opens from Create Pod and requires all mandatory fields before submission.
Zone selection drives pod placement; validate zone first to avoid rework.
If validation fails, correct subnet/range inputs before retrying submission.

Example values (reference):

Gateway: 192.168.117.253
Netmask: 255.255.255.0
Start IP: 192.168.117.20
End IP: 192.168.117.100

Validation of example:

Start and end are in same subnet as gateway
Range excludes gateway address
Enough free IPs remain for growth

If this fails:

See Common screen checks before retrying this screen.

7.2. Access Pod Details Page

When to Use:

Use this after pod creation or when validating existing pod network values.

Purpose:

Pod details confirms the zone mapping, allocation state, and final gateway/netmask/IP range values.

Navigation:

Control Center -> Pods

Steps:

Open the Pods dashboard.
Click the target pod row (Name) to open pod details.
Review General Information and Network Configuration.
Confirm zone mapping and network range values are correct.

Expected Outcome:

You validate pod readiness before cluster creation and host onboarding.

Pod details page with general information, network configuration, and cluster-host mapping

What this screenshot shows:

Pod detail header and allocation state
General Information and Network Configuration cards with finalized pod values
Attached Clusters and Hosts tables for downstream dependency validation

What you can find:

Pod-to-zone mapping
Pod gateway/subnet and host allocation range

Pod detail cards and fields:

General Information:

Field	Description
Pod Name	Pod identifier used in the infrastructure hierarchy.
Zone / Zone ID	Parent zone for this pod.
Allocation State	Controls whether new allocations can be made in the pod.

Network Configuration:

Field	Description
Gateway	Gateway address for the pod network.
Netmask	Subnet mask for pod addressing.
Start IP / End IP	Pod allocation IP range for infrastructure resources.

Viewing and pagination:

Click any pod row to open pod details.
Use table pagination controls to navigate large pod lists.

Expected Outcome:

Pod appears in list
Pod state is enabled
Pod details match entered network values

Do not proceed if:

Pod create fails due to invalid network values
Pod state is not enabled after creation

Warning

Do not use overlapping subnets/IP ranges. Ensure Start IP < End IP and both are in the selected subnet.

Next step:

Go to Control Center -> Clusters and verify/create the cluster in this pod.

If this fails:

See Common screen checks before retrying this screen.

8. Clusters

Clusters organize hosts into a workload-capable group. You select management type and hypervisor here. If you plan multiple environments, create separate clusters (for example, production vs testing) instead of mixing workloads in one cluster.

HA proxy note:

HA proxy helps distribute and maintain service traffic across available backend services.
HA proxy is enabled by default to improve resilience during node/service changes.
HA state is visible in cluster details, and HA actions can be available from the cluster actions menu based on role/policy.

Create vs verify:

Verify existing cluster health and mapping first.
Create a new cluster only when you need isolation, extra capacity, or a different workload boundary.

Open Clusters Dashboard

When to Use:

Use this when checking cluster health/placement or preparing to create a new cluster.

Purpose:

Cluster state determines whether hosts can be safely onboarded and whether workloads can be placed as expected.

Path:

Control Center -> Clusters

Steps:

Click the Control Center icon in the far-left rail.
Click Clusters.
The Clusters dashboard opens with cluster status and action controls.

Expected Outcome:

You confirm cluster mapping, status, and available actions before making cluster-level changes.

What you see first:

Cluster table with type, hypervisor, zone, pod, and health columns.
Create Cluster action and row-level Actions menu.

Estimated time: 3-5 minutes

Clusters dashboard showing cluster list with HA and managed state columns

What this screenshot shows:

Cluster table with Type, Hypervisor, Zone, Pod, Status, HA Enabled, and Managed State
Create Cluster action button and row-level actions menu

Action in this screen:

Review existing clusters
Click Create Cluster to open the create panel
Use row Actions for available cluster operations such as delete (subject to policy/state)

If this fails:

See Common screen checks before retrying this screen.

Cluster Actions

When to Use:

Use this when you need to run cluster-level operations from the list view.

Purpose:

Row actions are the fastest operational entry point for cluster controls without opening full details first.

Steps:

Open Control Center -> Clusters.
Find the target cluster row.
Open the row Actions menu.
Select the required action (for example HA toggle or delete, based on policy/state).

Expected Outcome:

You access cluster operations directly from the dashboard list for faster day-to-day management.

Cluster actions are opened from the row Actions menu in the Clusters table. Some environments also expose equivalent actions in the cluster details header.

Cluster actions menu with cluster operations in row actions

What this screenshot shows:

Actions dropdown opened from a cluster row
Cluster-level operations available from row actions (for example HA toggle and delete, based on policy/state)

Button	Description
Disable HA	Disables high availability for the selected cluster. Use only during planned maintenance or explicit operational exceptions. Result: cluster stays active, but HA protection is no longer applied until re-enabled.
Delete Cluster	Permanently removes the cluster and associated cluster object metadata. Result: the cluster is removed from the list and cannot be restored from the UI.

Disable HA (when available):

When to Use:

Perform only during approved maintenance/change windows when temporary removal of HA protection is required.

Purpose:

Temporarily disable HA protection only when a planned operation requires it and rollback is defined.

Navigation:

Control Center -> Clusters -> row Actions (⋮)

Steps:

Open the target cluster row Actions menu.
Click Disable HA.
Confirm the action when prompted.

Expected Outcome:

Cluster remains active, but automatic HA failover protection is disabled until re-enabled.

How to verify:

Open cluster details and confirm HA Enabled shows disabled/off.

If unavailable:

The action can be hidden by role, policy, or current cluster state.
Use approved change-control paths if direct action is blocked.

Delete Cluster (when required):

When to Use:

Perform only after all workloads, hosts, and dependencies are removed and the cluster is no longer needed.

Purpose:

Permanently remove an unused cluster object after dependency cleanup to avoid stale infrastructure state.

Navigation:

Control Center -> Clusters -> row Actions (⋮)

Steps:

Validate all delete safety checks listed below.
Open the target cluster row Actions menu.
Click Delete Cluster.
Confirm deletion in the dialog.

Expected Outcome:

The cluster object is permanently removed from the list.

How to verify:

Refresh Control Center -> Clusters and confirm the deleted cluster row is no longer present.

If unavailable:

Deletion is blocked while dependencies still exist or policy prevents delete.
Remove dependencies first, then retry.

Optional actions by environment:

View Details: open full cluster detail page.
Edit: modify supported cluster settings.
Scale: adjust associated cluster resources where supported.

Note

HA is enabled by default. If Disable HA is available in your environment, use it only with approved change control and clear rollback planning.

Warning

Action buttons can be disabled while the cluster is in a transitional state.

Delete safety checks (must pass before delete):

No hosts are attached to the cluster.
No running or stopped VMs are mapped to hosts in this cluster.
No active dependency (placement, policy, or operational runbook) still references this cluster.
A replacement cluster/path exists if workloads must continue.

If these checks are not met, do not delete. Correct dependencies first.

If this fails:

See Common screen checks before retrying this screen.

Create Cluster (When Required)

When to Use:

Perform this action when no existing cluster meets the required zone/pod mapping or workload isolation requirement.

Purpose:

Create a cluster when you need a new workload boundary, separate lifecycle domain, or additional host capacity grouping.

Navigation:

Control Center -> Clusters

Steps:

On the Clusters dashboard, click Create Cluster.
In the Create Cluster drawer, select Zone.
Select Pod.
Enter Cluster Name.
Select Cluster Type and Hypervisor when those fields are shown.
Submit with Create Cluster.

Create cluster side panel with zone, pod, and cluster name fields

What this screenshot shows:

Right-side Create Cluster drawer
Required Zone, Pod, and Cluster Name inputs
Create Cluster submit button at the bottom

Expected Outcome:

The cluster appears in the list with the selected zone/pod mapping.
Cluster details become available for validation and host onboarding.
A healthy cluster state confirms readiness for adding hosts.

How to verify:

Confirm the new cluster row appears in Control Center -> Clusters with correct Zone and Pod values.
Open cluster details and verify cluster status is healthy/Up.
Confirm the cluster is available for host onboarding.

If unavailable:

If Create Cluster is not visible, verify role permissions.
If Pod is unavailable in the form, verify an enabled pod exists in the selected zone.
If creation fails, use Cluster Troubleshooting.

If this fails:

See Common screen checks before retrying this screen.

8.1. Access Cluster Help Page

When to Use:

Use this before cluster create/action updates when you need quick definitions for state, policy, or placement fields.

Purpose:

Context help reduces incorrect zone/pod mapping and action misuse in cluster operations.

Navigation:

Control Center -> Clusters

Steps:

Open the Clusters dashboard.
Click the page help icon in the top-right.
Search for terms such as HA Enabled, Managed State, or Allocation State.
Use inline info icons beside headings/labels where shown.

Expected Outcome:

You confirm cluster field behavior before create/edit/action operations.

Clusters table columns:

Column	Description
Cluster Name	Cluster identifier. Select to open cluster details.
Type	Cluster management/orchestration model.
Hypervisor	Hypervisor technology used by hosts in the cluster.
Zone	Parent zone where the cluster is deployed.
Pod	Parent pod associated with the cluster.
Status	Cluster health state (for example, `Up` or `Down`).
HA Enabled	High availability state for the cluster.
Managed State	Cluster management mode (managed/unmanaged).
Actions	Row-level action menu for cluster operations.

Status indicators:

Up: cluster healthy and available.
Down: connectivity/resource issue requiring investigation.
HA Enabled = Yes: failover protection active.
HA Enabled = No: no HA protection for failure scenarios.

Tip

If Pod is empty, re-check that the selected zone has at least one enabled pod and that your RBAC scope includes that pod.

Warning

Do not proceed with cluster creation if zone/pod mapping is uncertain. Incorrect mapping will cause host onboarding and VM placement issues in downstream infrastructure steps.

If this fails:

See Common screen checks before retrying this screen.

8.2. Cluster Detail (Single Cluster View)

The Cluster Detail page gives a complete view of one cluster. Use it to inspect identity/configuration, monitor cluster metrics, confirm HA status visibility, and perform cluster-level actions.

When to Use:

Use this after cluster creation, during health checks, or before host/workload changes.

Purpose:

Cluster details provide the authoritative view for placement mapping, HA state, overcommit settings, and attached host readiness.

Path:

Control Center -> Clusters -> <cluster name>

Steps:

Open Control Center -> Clusters.
Select the target cluster row or use View Details from the row Actions menu.
The cluster details page opens for that specific cluster.

Expected Outcome:

You validate cluster readiness and configuration before taking downstream host or workload actions.

What you see first:

Cluster summary cards and configuration details for the selected cluster.
Section-level actions and detail areas (identity, configuration, metrics, and hosts).

Cluster details page showing general information, configuration, HA, and host list

What this screenshot shows:

Cluster type, hypervisor, and zone/pod placement details
Cluster status/readiness context for host onboarding
HA state, overcommit settings, and DRS imbalance visibility on the detail page

If this fails:

See Common screen checks before retrying this screen.

8.2.1. Detail Cards Overview

The detail page is organized into four primary information areas plus a hosts table for attached capacity.

First 60-second new-user check:

Confirm Zone and Pod are the intended target.
Confirm State and Allocation State are enabled for workload placement.
Confirm Hosts count is non-zero before planning workload onboarding.
Confirm HA Enabled is shown as active/default.
Scan DRS imbalance for obvious distribution issues before adding new VMs.

General Information

Displays cluster identification and location details.

Field	Description
Cluster Name	Unique cluster identifier.
Zone	Geographic/logical zone where the cluster is deployed. Click to open zone details.
Pod	Pod inside the selected zone. Click to open pod details.
Cluster Type	Management type (for example `CloudManaged`) indicating orchestration model.
Hypervisor	Underlying virtualization technology (for example KVM, VMware, Hyper-V).
Allocation State	Resource allocation state (`Enabled`/`Disabled`). Controls whether new workloads can be scheduled.

Tips:

Use Zone and Pod links to validate placement in the infrastructure hierarchy.
Hypervisor type determines VM compatibility and operational behavior.
Set allocation state to disabled during planned maintenance when you need to block new placements.

Configuration

Displays cluster configuration and resource management settings.

Field	Description
Managed State	Cluster management mode (for example `Managed` or `Unmanaged`).
CPU Overcommit Ratio	Allowed CPU overcommit level. Higher values can increase VM density per physical core.
Memory Overcommit Ratio	Allowed memory overcommit level. Higher values can increase VM density per available memory.

Tips:

Overcommit ratios directly influence workload density and contention risk.
Higher ratios can improve utilization but can reduce performance under heavy load.
Tune overcommit values based on workload profile and SLA targets.
Re-check cluster performance after any overcommit ratio change.

Starter baseline (new environments):

Production-critical workloads: start conservative (for example, CPU 1.0-2.0 and Memory 1.0) and increase only after performance validation.
General mixed workloads: start moderate (for example, CPU 2.0-4.0 and Memory 1.0-1.5), then tune using observed utilization.
Test/dev workloads: higher overcommit can be acceptable if performance variability is tolerated.

Warning

Baselines are starting points only. Always align final values with SLA, host capacity, and real workload telemetry.

Other Metrics

Displays additional operational and balancing indicators.

Field	Description
Hosts	Number of physical hosts in the cluster (shown as `current / total`).
DRS Imbalance Percentage	Distribution imbalance percentage indicating how evenly VMs are spread across hosts.
State	Overall cluster operational state (for example `Enabled` or `Disabled`).

Tips:

Monitor host count to understand current capacity and redundancy.
Lower DRS imbalance indicates better workload distribution.
Persistent high imbalance can indicate consolidation pressure or scheduling skew.
Use these metrics for migration planning and capacity optimization.

High Availability

Displays HA protection state and readiness.

Field	Description
HA Enabled	Shows HA proxy status. This is expected to remain enabled by default.

Tips:

HA is enabled by default in new environments.
Some environments expose HA controls in the cluster actions menu; others keep HA policy locked by role.
Confirm redundant infrastructure (hosts, storage, network paths) for effective failover behavior.
Test failover behavior regularly for critical workloads.
Verify workloads can restart on healthy nodes as expected after planned failover testing.

8.2.2. General Cluster Detail Tips

Use breadcrumb navigation to return to the Clusters dashboard quickly.
Treat card values as read-first operational telemetry and use action/edit flows for changes.
Review detail metrics regularly to catch capacity constraints early.
Reassess overcommit settings periodically based on observed workload behavior.
Use cluster detail data to plan host additions and VM migration windows safely.

Expected Outcome:

Cluster appears in list with correct mapping
Cluster is healthy and ready for hosts
HA proxy state is visible as active/default in cluster details

Do not proceed if:

Cluster remains unhealthy after creation
Cluster mapping (zone/pod/hypervisor) is incorrect

Tip

Use separate clusters for production and non-production workloads.

Next step:

Go to Control Center -> Hosts and add/verify hosts for this cluster.

9. Hosts

Hosts are physical servers added into clusters. This is the final infrastructure step before VM creation. Each host must be reachable through BMC and discovered successfully by Karios to contribute capacity.

Create vs verify:

Verify existing hosts if current capacity is sufficient.
Add hosts only when scaling, replacing hardware, or onboarding a new cluster.

Open Hosts Dashboard

When to Use:

Use this when validating host readiness or before onboarding additional hardware.

Purpose:

Hosts are the final infrastructure layer before VM operations, and host health directly impacts workload stability.

Path:

Control Center -> Hosts

Steps:

Click the Control Center icon in the far-left rail.
Click Hosts.
The Hosts dashboard opens with host health/capacity columns and onboarding actions.

Expected Outcome:

You confirm host availability, utilization, and onboarding entry points before creating or migrating workloads.

What you see first:

Hosts table with state, CPU, and memory columns.
Add Host action and cluster filter controls.

Estimated time: 5-10 minutes per host

Hosts dashboard showing host health and capacity columns

What this screenshot shows:

Host table with IP Address, State, CPU Cores, Memory, CPU Used, and Memory Used
All clusters filter and Add Host button in the top-right
The example host row is in Up state (workload-ready context)

If this fails:

See Common screen checks before retrying this screen.

Host Row Actions Entry Point

When to Use:

Use this when you need host-specific operations from the hosts list.

Purpose:

Row actions provide direct access to operational controls (enable/disable/maintenance/reconnect) without leaving the dashboard.

Steps:

Open Control Center -> Hosts.
Find the target host row.
Open the row Actions menu.
Select the required host operation.

Expected Outcome:

You trigger host-level operations quickly from the list view.

Hosts dashboard row actions menu for host operations

What this screenshot shows:

Row-level Actions menu used for host-specific operations.
Operational entry point for per-host management tasks.

If this fails:

See Common screen checks before retrying this screen.

Filter Hosts by Cluster

When to Use:

Use this when you need to isolate hosts for a specific cluster during onboarding, troubleshooting, or maintenance.

Purpose:

Filtering reduces operator error by limiting actions to the intended cluster scope.

Steps:

Open Control Center -> Hosts.
Open the All clusters filter.
Select the required cluster name.
Review the filtered host list before taking actions.

Expected Outcome:

You work on the correct host set for the selected cluster.

Hosts dashboard cluster search/filter control

What this screenshot shows:

Cluster search/filter control for narrowing the hosts list by cluster.
Quick way to isolate hosts for one cluster during onboarding and troubleshooting.

Host table columns:

Column	Description
Host Name	Unique host identifier. Select to open host details.
IP Address	Management-reachable host IP.
State	Operational host state.
CPU Cores	Total physical CPU cores available.
Memory	Total host memory capacity.
CPU Used	Current CPU utilization percentage.
Memory Used	Current memory utilization percentage.

State meanings:

Up: host is healthy and workload-ready.
Down: host is offline or unreachable.
Disabled: host is intentionally disabled for new allocations.

Warning

Successful onboarding requires host state Up. Treat Alert, Down, or Disconnected as blocking states and resolve them before VM provisioning.

Action in this screen:

Review host health and capacity
Click Add Host to open the onboarding form

If this fails:

See Common screen checks before retrying this screen.

Add Host (When Required)

When to Use:

Perform this action when a cluster needs additional capacity, hardware replacement, or first-time compute onboarding.

Purpose:

Add a host when you need more compute capacity, are replacing hardware, or onboarding infrastructure into a new or existing cluster.

Navigation:

Control Center -> Hosts

Steps:

On the Hosts dashboard, click Add Host.
Enter location and hardware identity fields.
Enter BMC IP, username, password, and vendor.
Continue to Review and validate inputs.
Submit the host onboarding request.

Add Host side panel with details step and BMC credential fields

What this screenshot shows:

Stepper-based host onboarding (Details then Review)
Required site/location/rack/manufacturer/device position inputs
BMC credential fields used for hardware discovery
Continue button to move into review before final submit

Expected Outcome:

The host record appears in the hosts list.
Discovery populates hardware and hypervisor details.
Host state transitions to Up when connectivity and validation checks pass.

How to verify:

Confirm the host row appears in Control Center -> Hosts.
Open host details and verify CPU/memory/hypervisor values are populated.
Confirm host State transitions to Up.

If unavailable:

If Add Host is not visible, verify role permissions.
If submission fails, validate BMC IP reachability and credentials.
If onboarding remains unsuccessful, use Host Troubleshooting.

If this fails:

See Common screen checks before retrying this screen.

9.1. Access Host Help Page

When to Use:

Use this before onboarding/editing hosts when you need clarification for discovery and credential fields.

Purpose:

Help definitions reduce onboarding failures caused by incorrect site/location/BMC inputs.

Navigation:

Control Center -> Hosts

Steps:

Open the Hosts dashboard.
Click the page help icon in the top-right.
Search for fields such as BMC IP, State, or Cluster.
Use inline info icons beside labels to verify expected values and format.

Expected Outcome:

You confirm host-input requirements before submission and reduce onboarding retries.

Add host steps (detailed input guide):

Enter location data (site/location/rack/position)
Enter hardware/manufacturer details
Enter BMC data:
- BMC IP
- BMC username
- BMC password
- BMC vendor
Enter optional device name if required by your internal naming policy
Click Continue and review all values
Submit the host from the Review step

Expected Behavior

Typical host onboarding timeline:

0:00 - Host record is created in the list
0:30-2:00 - Discovery populates hardware details (CPU/RAM/hypervisor)
2:00-5:00 - State transitions to Up when connectivity and checks pass

If host remains Down or Disconnected beyond 5 minutes, verify BMC IP reachability and credentials first.

If this fails:

See Common screen checks before retrying this screen.

9.2. Access Host Details Page

When to Use:

Use this after host onboarding or during host-health troubleshooting.

Purpose:

Host details confirms discovered hardware, hypervisor, and OS metadata needed for workload placement decisions.

Navigation:

Control Center -> Hosts

Steps:

Open the Hosts dashboard.
Click the target host row (Host Name) to open host details.
Review identification, hypervisor, CPU/memory, and OS cards.
Confirm host state and discovered values are valid.

Expected Outcome:

You confirm the host is ready for workload placement and ongoing operations.

Host detail page with identification, hypervisor, CPU, memory, and OS cards

What this screenshot shows:

Host detail cards for general identification, hypervisor details, CPU/memory allocation, and OS metadata.
Direct validation view for onboarding completeness and host readiness.

What you can find:

Host state, role/type, hypervisor, CPU/memory visibility

Common host tasks:

Open host details for configuration and telemetry checks.
Review CPU and memory utilization before placing additional workloads.
Investigate non-Up states before VM provisioning.

If this fails:

See Common screen checks before retrying this screen.

9.3. Access Host Alerts Tab

When to Use:

Use this when host-level warnings or health events must be reviewed.

Purpose:

Alerts identify host conditions that can block placement or require immediate remediation.

Navigation:

Control Center -> Hosts -> <host name> -> Alerts

Steps:

Open the host details page.
Select the Alerts tab.
Review active alerts, severity, and timestamps.

Expected Outcome:

You identify host issues and can prioritize remediation before workload changes.

Host details Alerts tab showing host-related alert entries

If this fails:

See Common screen checks before retrying this screen.

9.4. Access Host Network Tab

When to Use:

Use this to validate network interfaces and host network connectivity details.

Purpose:

Network tab checks help prevent VM connectivity issues and host communication failures.

Navigation:

Control Center -> Hosts -> <host name> -> Network

Steps:

Open host details.
Select Network.
Review interface/state details and expected network values.

Expected Outcome:

You confirm host network readiness for VM operations.

Host details Network tab showing host network information

If this fails:

See Common screen checks before retrying this screen.

9.5. Access Host GPU Tab

When to Use:

Use this when planning or validating GPU-capable workloads.

Purpose:

GPU visibility confirms accelerator inventory and availability per host.

Navigation:

Control Center -> Hosts -> <host name> -> GPUs

Steps:

Open host details.
Select GPUs.
Verify detected GPU devices and current assignment state.

Expected Outcome:

You confirm whether the host can support GPU-backed VM workloads.

Host details GPUs tab showing available GPU devices

If this fails:

See Common screen checks before retrying this screen.

9.6. Access Host Console Tab

When to Use:

Use this during deeper troubleshooting when direct host console access is required.

Purpose:

Console access supports low-level diagnostics when UI-level checks are insufficient.

Navigation:

Control Center -> Hosts -> <host name> -> Console

Steps:

Open host details.
Select Console.
Launch and use console access per your operational policy.

Expected Outcome:

You can perform direct host inspection/troubleshooting where permitted.

Host details Console tab with host console access option

If this fails:

See Common screen checks before retrying this screen.

9.7. Access Host Metrics Tab

When to Use:

Use this to inspect host metrics before scaling or maintenance actions.

Purpose:

Metrics help confirm host load profile and detect resource pressure early.

Navigation:

Control Center -> Hosts -> <host name> -> Metrics

Steps:

Open host details.
Select Metrics.
Review host utilization metrics.

Expected Outcome:

You get host-level metrics context for capacity and incident decisions.

Note

A dedicated Metrics tab screenshot will be available in the next release.

If this fails:

See Common screen checks before retrying this screen.

9.8. Access Host Karios Shield Tab

When to Use:

Use this for host security posture review and security-related follow-up checks.

Purpose:

Karios Shield data provides host-level security insight needed for risk-based operations.

Navigation:

Control Center -> Hosts -> <host name> -> Karios Shield

Steps:

Open host details.
Select Karios Shield.
Review host security status/findings.

Expected Outcome:

You validate security status before maintenance or workload placement changes.

Host details Karios Shield tab showing host security information

If this fails:

See Common screen checks before retrying this screen.

9.9. Access Host Storage Tabs

When to Use:

Use this to inspect host-local storage state and dataset/pool context.

Purpose:

Storage-tab checks help confirm storage readiness and identify capacity/layout issues.

Navigation:

Control Center -> Hosts -> <host name> -> Storage

Steps:

Open host details.
Select Storage.
Review Overview, Pools, and Datasets tabs.

Expected Outcome:

You validate storage configuration and capacity context at host level.

If this fails:

See Common screen checks before retrying this screen.

9.10. Access Host VM Details Tab

When to Use:

Use this to verify which virtual machines are currently placed on a host.

Purpose:

VM details on the host page help validate placement, investigate host load, and identify impacted VMs during host incidents.

Navigation:

Control Center -> Hosts -> <host name> -> VMs

Steps:

Open Control Center -> Hosts.
Open the target host details page.
Click the VMs details tab.
Review VM name/state information for that host.

Expected Outcome:

You can confirm host-to-VM mapping and use it for troubleshooting or maintenance planning.
For newly onboarded hosts, an empty VMs tab is expected until VMs are deployed or migrated to that host.

Host details page VMs tab showing VM list and states for the selected host

What this screenshot shows:

Host details VMs tab with VM entries associated to the selected host
Direct view of VM placement context used for host-level operations
This screenshot is a populated example; your host can show no VM rows, which is normal for a new host.

Empty state behavior:

If the host is newly added and you have not deployed/migrated VMs yet, No virtual machines found (or an empty list) is expected behavior.
This is not a host onboarding failure by itself.

If you expected VMs but the list is empty:

Confirm VMs were created in Compute -> Virtual Machines.
Confirm placement/migration targeted this host’s cluster.
Refresh host details and re-check the VMs tab.

Expected Outcome:

Host appears in list
Host state moves to Up
Capacity details are discovered

Host ready-for-workload checklist:

Host is mapped to correct zone/pod/cluster
Hypervisor is detected correctly
CPU and memory values are visible
Host state is Up and not in maintenance/disabled state
Parent cluster status is healthy

Do not proceed if:

Host state remains Down or Disconnected
Host state is Alert
Hypervisor/capacity values are missing

Warning

Incorrect BMC IP/credentials is the most common host onboarding failure.

Next step:

Run the final validation checklist in Section 10, then continue to Compute -> Virtual Machines.

If this fails:

See Common screen checks before retrying this screen.

10. Final Validation

Run this checklist before moving to the VM workflow:

Dashboard cards visible and not all zero
Zone exists and enabled
Pod exists under correct zone
Cluster exists under correct pod
Host is added and healthy (Up)
Zone, pod, cluster, and host details pages open without errors

Warning

If any checklist item fails, stop here and use the troubleshooting sections below before moving to VM provisioning.

If all pass, Infrastructure is ready.

Success path summary:

Dashboard healthy
Zone enabled
Pod enabled with valid IP range
Cluster healthy with correct mapping
Host Up with visible capacity

When all of the above are true, proceed to the Compute -> Virtual Machines section.

11. Safe Correction (If You Created Something in the Wrong Place)

If you created objects in the wrong scope:

Stop and confirm what is wrong (zone, pod, cluster, or host mapping).
Do not proceed to VM deployment until corrected.
For pod/cluster mistakes, create the correct object first, then remove the incorrect one.
For host mistakes, reassign or re-onboard host into the correct cluster path.
Re-run the final validation checklist before continuing.

Warning

Delete actions should be done only after confirming no dependent workloads are attached. Never delete a cluster that still has attached hosts or running VMs.

12. Troubleshooting

Common issues:

Issue	Likely cause	Fix
Dashboard all zeros	Bootstrap incomplete/failed	Check Bootstrap logs, verify services, rerun setup if needed
Cannot create pod	Invalid subnet/range or zone not selected	Recheck gateway/netmask/IP range and zone selection
Cannot create cluster	Zone/pod dependency not met	Select zone first, then pod; verify both enabled
Host remains down	BMC unreachable or bad credentials	Ping BMC IP, verify login, check power/network, retry onboarding
Host in `Alert` state	Partial onboarding, hardware/service fault, or degraded management connectivity	Open host details, review alerts/logs, correct root cause, and confirm host returns to `Up`

12.1. Dashboard Troubleshooting

Issue: Dashboard cards show all zeros

When to Use:

Use this when Dashboard cards show all zeros is observed during infrastructure operations.

Purpose:

Identify the likely cause for this issue and restore the affected infrastructure object to a healthy state.

Steps:

Confirm you are logged into the correct environment/tenant.
Refresh the page and re-check cards.
Verify Bootstrap completed successfully.
Confirm at least one zone and pod exist.
Check services/logs with administrator if still zero.

Issue: Metrics not visible

Steps:

Confirm at least one host is onboarded and healthy.
Confirm storage and core services are online.
Wait a few minutes and refresh.
If still blank, escalate with screenshot and timestamp.

Expected Outcome:

The action completes successfully and the expected state is visible in the UI.

If this fails:

See Common screen checks before retrying this screen.

12.2. Zone Troubleshooting

Issue: Zone not visible

When to Use:

Use this when Zone not visible is observed during infrastructure operations.

Purpose:

Identify the likely cause for this issue and restore the affected infrastructure object to a healthy state.

Steps:

Check your RBAC permissions.
Clear zone filters/search.
Verify zone exists in the target environment.
Ask admin to verify domain/account scope.

Issue: Zone exists but disabled

Steps:

Open zone details and confirm state.
Check recent change logs/audit events.
Re-enable zone (if permitted) or request admin action.
Re-validate zone readiness before creating pods.

Expected Outcome:

The action completes successfully and the expected state is visible in the UI.

If this fails:

See Common screen checks before retrying this screen.

12.3. Pod Troubleshooting

Pod subnet overlap reference:

Example	Status	Why
Existing pod `10.10.0.0/24` and new pod `10.10.1.0/24`	Valid	Separate CIDRs, no overlap
Existing pod `10.10.0.0/24` and new pod `10.10.0.128/25`	Invalid	New subnet is inside existing range
Existing pod `172.16.0.0/16` and new pod `172.16.10.0/24`	Invalid	New range overlaps parent CIDR

Issue: Pod creation fails

When to Use:

Use this when Pod creation fails is observed during infrastructure operations.

Purpose:

Identify the likely cause for this issue and restore the affected infrastructure object to a healthy state.

Steps:

Confirm zone is selected first.
Validate gateway and netmask are correct.
Validate start/end IP are in subnet and Start < End.
Check for overlap with existing pod ranges.
Retry create operation.

Issue: Pod created but unusable

Steps:

Open pod details and verify network values.
Confirm pod state is enabled.
Confirm cluster creation uses the same zone/pod.
Fix network plan and recreate pod if values are wrong.

Expected Outcome:

The action completes successfully and the expected state is visible in the UI.

If this fails:

See Common screen checks before retrying this screen.

12.4. Cluster Troubleshooting

Issue: Pod dropdown not available in cluster form

When to Use:

Use this when Pod dropdown not available in cluster form is observed during infrastructure operations.

Purpose:

Identify the likely cause for this issue and restore the affected infrastructure object to a healthy state.

Steps:

Select zone first.
Verify selected zone has at least one enabled pod.
Confirm RBAC visibility for that pod.

Issue: Cluster unhealthy after creation

Steps:

Verify cluster mapping (zone/pod) is correct.
Verify hypervisor selection matches host capabilities.
Confirm at least one healthy host is present/added.
Re-check status after host onboarding.

Expected Outcome:

The action completes successfully and the expected state is visible in the UI.

If this fails:

See Common screen checks before retrying this screen.

12.5. Host Troubleshooting

Issue: Add host fails

When to Use:

Use this when Add host fails is observed during infrastructure operations.

Purpose:

Identify the likely cause for this issue and restore the affected infrastructure object to a healthy state.

Steps:

Verify BMC IP is reachable (ping/network path).
Verify BMC username/password by logging into BMC directly.
Verify BMC vendor selection matches hardware.
Verify server is powered and connected to management network.
Retry add host flow.

Issue: Host remains Down

Steps:

Check physical power and network link.
Check BMC remote console for boot/hardware errors.
Confirm hypervisor services are running.
Review host and management logs with administrator.
Remove/re-add host only after root cause is corrected.

Expected Outcome:

The action completes successfully and the expected state is visible in the UI.

If this fails:

See Common screen checks before retrying this screen.

13. Support

Support contact: support@karios.com

When opening a ticket, include:

Username/account
Affected area (Dashboard/Zone/Pod/Cluster/Host)
Exact error text
Screenshot
Steps already tried
Operational impact

Suggested ticket subject:

Infrastructure | <Component> | <Issue> | <ObjectName>
Example: Infrastructure | Host | Down | dc1-compute-03

→ Next: User Management

Expected Outcome

After this section, you should be able to:

explain zone/pod/cluster/host hierarchy
navigate Control Center infrastructure modules
perform core infrastructure actions safely

What To Do Next

Confirm at least one host is Up with visible CPU/RAM capacity.
Verify the Zone -> Pod -> Cluster mapping matches your deployment plan.
Continue to Compute - Virtual Machines for first workload provisioning.