1. Scope the Problem
Is the issue affecting one VM, one network, one node, or the entire system? Scoping determines which diagnostic tool to start with.
This page compiles the most common issues encountered by VergeOS administrators, organized by subsystem. Each section includes symptoms, root causes, and step-by-step resolution procedures.
Network connectivity problems are the most common support topic. Before diving in, verify whether other VMs in the same environment can reach the internet. If none can, the issue is likely upstream of VergeOS (switch, firewall, ISP). If other VMs work fine, the problem is almost always a configuration miss on the affected VM.
Symptom: VM boots but has no network interfaces visible in the guest OS.
Resolution:
Symptom: VM has a NIC but cannot reach other VMs or the internet.
Resolution:
Symptom: Windows VM shows no network adapter in Device Manager, even though a NIC is configured in VergeOS.
Resolution:
Symptom: NIC is present and drivers are installed, but the VM still cannot reach the network.
Resolution:
Administrators migrating from VMware or Nutanix often notice that VergeOS reports higher memory usage than they expect. This is by design — not a problem.
Symptom: VergeOS shows a VM using 8 GB of RAM, but the guest OS task manager shows only 2 GB in use.
Explanation: VergeOS displays allocated memory — the physical RAM reserved on the host for that VM. When you assign 8 GB to a VM, the hypervisor immediately reserves 8 GB of physical memory, regardless of what the guest is actively consuming. This is the real resource commitment on the host.
Unlike VMware (which uses balloon drivers to reclaim unused guest memory) or Nutanix AHV (which supports dynamic memory management), VergeOS intentionally does not use memory ballooning. This design choice provides:
| Metric | Where to Check | What It Means |
|---|---|---|
| VM Allocated RAM | VM Dashboard | Physical RAM reserved for this VM |
| Guest Active RAM | Inside guest OS (Task Manager / free -h) | What the guest is actually using |
| Node Available RAM | Node Dashboard → Memory | How much host RAM remains unallocated |
| Cluster Target Max RAM % | System → Settings → Advanced | Threshold for VM placement decisions |
Some server hardware generates repetitive, benign IPMI log entries that fill the System Event Log (SEL) and trigger unnecessary alerts. The most common offender is the “Get SEL Info command failed” message.
The System Event Log is stored in hardware (on the BMC/IPMI controller) with limited capacity. Once full, new events cannot be recorded until the log is cleared. The node dashboard shows SEL capacity as a percentage bar.
To suppress false-positive messages without losing real hardware alerts:
{ "key": "syslog_regex_list", "value": "2E2A4765742053454C20496E666F20636F6D6D616E64206661696C65642E", "default_value": "", "description": "Hex encoded lines of regular expressions to filter out of syslog"}The value is a hex-encoded regex: .*Get SEL Info command failed. — you can encode additional patterns at a hex encoding tool and separate multiple patterns with |.
Example — filtering two patterns:
The regex (Get SEL Info command failed|Unable to send command: Device or resource busy) encodes to:
284765742053454C20496E666F20636F6D6D616E64206661696C65647C556E61626C6520746F2073656E6420636F6D6D616E643A20446576696365206F72207265736F75726365206275737929After applying the filter, restart log capture on each affected node:
Option A — Via UI:
Option B — Via SSH:
sudo systemctl restart openipmiIf the SEL is already full:
Symptom: Windows 10/11 clients cannot access CIFS shares, receiving “access denied” or “cannot connect” errors even with correct credentials.
Root Cause: Modern Windows defaults to disabling insecure guest logons for SMB connections.
Resolution — Enable Insecure Guest Logons:
Win + R, type gpedit.msc, press EnterSymptom: macOS Finder cannot connect to CIFS shares, connections drop intermittently, or performance is unusable.
Resolution — Force SMB3 via nsmb.conf:
sudo nano /etc/nsmb.conf[default]smb_neg=smb3_onlysigning_required=norm -rf ~/Library/Caches/com.apple.finderkillall FinderSamba Fruit Module: For full macOS compatibility (Spotlight indexing, Time Machine support, resource forks), enable the Samba fruit VFS module in the NAS volume advanced settings. This enables native Apple SMB extensions.
Symptom: Users receive “Access Denied” when browsing or opening files on a share, even though they can see the share name.
Resolution checklist:
Symptom: File transfers over CIFS are significantly slower than expected.
Resolution:
Symptom: Node fails to boot from the VergeOS USB installer.
Resolution:
Symptom: Installation completes but the node cannot communicate with other nodes or the network.
Resolution:
Symptom: VergeOS installer does not detect all expected drives.
Resolution:
Symptom: Secondary controller or compute node fails to join the existing cluster.
Resolution:
Symptom: Dashboard shows a vSAN tier in “degraded” or “not redundant” status.
Explanation: A degraded state means one or more drives in a tier have failed or are unavailable, but the vSAN is still operational. Data remains accessible because VergeOS maintains redundancy across nodes.
Resolution:
Understanding expectations: Rebuild times depend on the amount of data on the tier and the I/O capacity of the remaining drives. During a rebuild:
Symptom: Dashboard alerts warn about storage capacity approaching limits.
Resolution:
| Utilization Level | Action Required |
|---|---|
| < 70% | Normal operation — no action needed |
| 70–85% | Plan capacity expansion; review snapshot retention policies |
| 85–90% | Actively reduce usage or add capacity |
| > 90% | Critical — prioritize expansion; risk of write failures |
When facing an issue that doesn’t fit the categories above, follow this general workflow:
1. Scope the Problem
Is the issue affecting one VM, one network, one node, or the entire system? Scoping determines which diagnostic tool to start with.
2. Use Component Diagnostics
Start with the component-specific diagnostic tool (Network, Node, NAS, or vSAN Diagnostics) — they run in the correct context automatically.
3. Check System Logs
Review Dashboard logs and system alerts for correlated events. Look for patterns — did multiple alerts fire at the same time?
4. Escalate with Data
If the issue is not resolved, generate a System Diagnostics bundle (System → System Diagnostics) and submit it with your support request.