vSphere 7

Log Disk Exhaustion on VCSA 7 Fix

Log Disk Exhaustion on VCSA 7 Fix. A look at troubleshooting and resolving issues with disk space on the log drive of vCenter Server VCSA

Keeping a “happy” vCenter Server in your vSphere environment is crucial to having a vSphere environment that operates correctly and smoothly. ย Space issues are definitely an issue that can cause problems. ย I wanted to shed light on a problem I have seen a couple of times in my home lab environment with myย VCSA 7. ย After logging into the vSphere Client, ย I have seen the messageย Log Disk Exhaustion on VCSA. ย What is the error? ย In this blog, I will walk you through triaging and resolving this issue with disk space on your vCenter Server VCSA appliance, so we can take a look at Log Disk Exhaustion on VCSA 7 Fix. ย ย 

Log Disk Exhaustion on VCSA 7 Update 2 Error

Have you seen this in your environment before with your VCSA appliance? You will probably see many posts, articles, and other resources in this case out there with others having the same issue. In the original blog post for the lab environment, I was running VCSA 7 Update 2.  Note the message in the screenshot below. You will see this error below. Note, this is not telling us we have an issue on a vSphere datastore (storage volume) or vSAN volume, but rather on a partition physically attached to vCenter Server (vCenter appliance).

By default, the error messages begin when disks are 80% full. When you see this, it means you need to give attention to space as it can certainly affect the performance of your VCSA appliance.

Log disk exhaustion on vcsa error message
Log disk exhaustion on vcsa error message

A bit of Googling led to the following VMware KB. You can find that support KB here: vCenter Server Appliance disk space is full (76563) (vmware.com). The official VMware Knowledge base article describes the steps to find the root cause of the issue with storage utilization in the vCenter Server in your VMware virtualization environment and begin troubleshooting. 

  • Partitions are not sized correctly
  • Large files in the partitions
  • Tables in the database consuming space
  • Services are failing to clean up files

For my VCSA appliance, I was using the standard, out-of-the-box partition sizes that come with a fresh deployment. ย I don’t really think I am seeing an issue with the services cleaning up log space as I think the logs are simply accumulating and becoming large enough before being pruned that it is flagging the warning in the VCSA vSphere Client. ย However, I wanted to clean up the space for the environment.

Steps to perform before troubleshootingย 

I definitely recommend that you have good backups of your VCSA appliance before you begin performing any steps below for disk usage maintenance on your vCenter host. What are the best ways to backup your VCSA appliance?

  1. One of the first things I do is take a snapshot of the VCSA appliance from the ESXi host, not from vCenter Server. So login to the host that has ownership of your VCSA virtual machine and take a quick snapshot.
  2. You can also have a file-level backup of your vCenter Server. You can take this from the VAMI management interface
  3. Have a full virtual machine backup using something like Veeam, NAKIVO, BDRSuite, etc. In a break glass situation, you can restore the full VM back to the cluster and be good to go.

Log Disk Exhaustion on VCSA 7 Fix and Solutions

To find where the disk space is being used, you can start by establishing an SSH connection to your VCSA appliance and use theย df -hย Linux command to verify which partition is having an issue.ย  On a Windows machine, you can use Putty. As suspected with the error message, the pathย /storage/logย partition is the one having issues. However, each environment may have different partitions that are causing an issue. The methods and tips below though can apply to any of the vCenter partition hard disk configurations.ย 

Looking at storage space used on the vcsa appliance
Looking at storage space used on the vcsa appliance

Next, to start looking at where the storage space is being used specifically in the /storage/log partition, you can access the shell and issue the command from the shell and press enter:

du -a /storage/log | sort -n -r | head -n 20
Finding the largest directories in the storage log partition
Finding the largest directories in the storage log partition

By looking at the directory structure listed above by size, I drilled down into theย /storage/log/vmware/eam/webย partition and found quite a few .log files that were 8 MB a piece for the ESX Agent Manager logging directory. ย After removing most if not all, I was able to reclaim a couple of gigs worth of space.

Vmware vsphere esx agent manager log files in storage log partition
Vmware vsphere esx agent manager log files in storage log partition

Next directory up for scrutiny is theย /storage/log/vmware/lookupsvcย directory listed in the output earlier. ย In my environment, below is a screenshot of the log files that I had accumulated under the directory. ย While not sizeable in and of themselves, combined with the sheer number, these were taking a good chunk of disk space also. ย  ย 

Cleaning up the storage log vmware lookupsvc directory
Cleaning up the storage log vmware lookupsvc directory

Under the same directory another level deep is theย tomcatย directory. ย Under the tomcat directory, I had another set of fairly numerous log files that had accumulated here. ย Again, I axed the ones that were older and kept just the most recent. ย This resulted in even more space gained back. ย 

The lookupsvc tomcat directory files
The lookupsvc tomcat directory files

After going through manually pruning the files listed in the directories above, I was able to reclaim quite a bit of space and now had over 4 gigs free compared to the under 1 gig free that I had when I started the exercise. ย 

After performing cleanup on the log files on the storage log partition
After performing cleanup on the log files on the storage log partition

VCSA Log Disk Full Video

Video showing concepts to find log files taking up space in your VCSA appliance

Increase the size of a VCSA hard disk

There is a supported way to increase the size of the partitions on your VCSA. Note the following steps:

To identify the logical volume mapping to a partition, execute the following command:

lsblk -o NAME,HCTL,MOUNTPOINT,SIZE

For obtaining the detailed PCI device paths for each block device, use:

ls -al /sys/block/sd*

To pinpoint the partition requiring expansion, take, for example, /storage/lvm_snapshot which corresponds to the logical volume /sys/block/sdq. Ascertain the SCSI label for the related PCI device by executing:

cat /sys/devices/pci0000:00/0000:00:11.0/0000:02:00.0/label

Should the device path identified in the previous step lack a serial number format preceding the host directory (at the fourth directory level), it signifies mapping to SCSI0. For instance, /sys/devices/pci0000:00/0000:00:10.0/ is associated with SCSI0.

Understanding that the partition /sys/block/sdq aligns with SCSI device 1 and Unit number 1, derived from the third segment of HCTL from the initial step’s output, it is mapped as SCSI(1:1).

To change disk space through the BASH Shell, ensure no snapshot is attached to the vCenter Server VM, then:

SSH into the vCenter Server Appliance using root credentials.

Activate the Bash shell with:

shell.set --enabled true

Enter shell and press Enter.

Identify the disk with capacity issues by executing:

df -h; lsblk; lsscsi

If, for instance, /storage/log correlates with sde and its SCSI ID is SCSI(0:4), this information will be mirrored in the vsphere client’s virtual machine settings as the disk under SCSI(0:4).

To expand the disk space of the vCenter Server Appliance virtual machine, use the vSphere Client or vSphere Web Client.

Properties of vcsa appliance
Properties of vcsa appliance

Following the disk expansion, execute this script within the SSH session to automatically enlarge any logical volumes with extended physical volumes:

/usr/lib/applmgmt/support/scripts/autogrow.sh

Verify the successful disk enlargement with:

df -h

For disk space augmentation via the Appliance Management CLI:

SSH into the vCenter Server Appliance with root credentials.

Identify the disk facing capacity constraints. For vCenter Server Appliance versions:

6.5 and 6.7, execute:

com.vmware.appliance.version1.resources.storage.stats.list

Post disk expansion, to automatically resize logical volumes on the extended physical volumes for vCenter Server Appliance versions 6.5 and 6.7, run:

com.vmware.appliance.system.storage.resize

Lastly, re-run the storage stats list command for confirmation:

com.vmware.appliance.version1.resources.storage.stats.list

Monitoring

You definitely want to monitor your VMware vCenter Server with tools that can shed light on disk space issues when the capacity on your partitions is becoming exhausted or you reach a certain threshold on your disk resource. Be sure to setup an alert you can trigger in this scenario when disk usage has grown. You can use vRealize Operations (now VMware Aria Operations) or other tools like VMware Skyline. Plus, you get the benefit of other automation from these tools.

Wrapping up VCSA log disk exhaustion fix

If you get the message in the vSphere Client, Log Disk Exhaustion on VCSA, like me you are probably wondering what is going on with the log directory. Hopefully, the content here will help you to pinpoint the issue with vCenter and if you need to expand disk size in the VM settings. The Log Disk Exhaustion onย VCSA 7ย Fix for me was going through the directories listed in the output of the “du -a” command and finding directories there that had many log files contained in the directories. I didn’t really find a silver bullet 1 or two files that were generated and causing the disk space issues.

For me, it was just a combination of log files in the storage/log partition that was the culprit. Hopefully, in addition to the VMware KB article referenced in the post, this will help any who want to get past this error and issue on their VCSA appliance. Let me know if you have seen this issue in the comments below. Or, do you use Proxmox, Hyper-V, XCP-ng, Nutanix, or another solution? I would be curious to know that also. If you have any questions or suggestions let me know in a comment or the VHT forums.

Subscribe to VirtualizationHowto via Email ๐Ÿ””

Enter your email address to subscribe to this blog and receive notifications of new posts by email.



Brandon Lee

Brandon Lee is the Senior Writer, Engineer and owner at Virtualizationhowto.com, and a 7-time VMware vExpert, with over two decades of experience in Information Technology. Having worked for numerous Fortune 500 companies as well as in various industries, He has extensive experience in various IT segments and is a strong advocate for open source technologies. Brandon holds many industry certifications, loves the outdoors and spending time with family. Also, he goes through the effort of testing and troubleshooting issues, so you don't have to.

Related Articles

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.