Spring Sale Special Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: scxmas70

NCP-AII Exam Dumps - NVIDIA AI Infrastructure

Searching for workable clues to ace the NVIDIA NCP-AII Exam? You’re on the right place! ExamCert has realistic, trusted and authentic exam prep tools to help you achieve your desired credential. ExamCert’s NCP-AII PDF Study Guide, Testing Engine and Exam Dumps follow a reliable exam preparation strategy, providing you the most relevant and updated study material that is crafted in an easy to learn format of questions and answers. ExamCert’s study tools aim at simplifying all complex and confusing concepts of the exam and introduce you to the real exam scenario and practice it with the help of its testing engine and real exam dumps

Go to page:
Question # 9

One of the nodes in a cluster is not running as fast as the others and the system administrator needs to check the status of the GPUs on that system. What command should be used?

A.

lspci | grep NVIDIA

B.

nvidia-smi

C.

nvidia-gpu-status

D.

iblinkinfo

Full Access
Question # 10

During a DGX cluster deployment, what is the most effective way to verify the health and integrity of the local RAID storage array?

A.

Run a read/write benchmark utility, such as FIO, across the RAID array, looking for expected speed and latency metrics as proof of storage integrity.

B.

Verify that all configured RAID volumes are mounted and available in the operating system, and that disk utilization levels are within recommended limits.

C.

Use the mdadm --examine and mdadm --detail commands to review the RAID array’s status, checking for drive failures, array consistency, and error events.

Full Access
Question # 11

What is the purpose of using NCCL in verifying East-West fabric in an NVIDIA AI Factory?

Pick the 2 correct responses below.

A.

To measure the storage network performance.

B.

To measure the latency between GPUs.

C.

To measure the power consumption of GPUs.

D.

To measure bandwidth between GPUs.

Full Access
Question # 12

Which of the following steps are essential components of a recommended DGX cluster installation procedure?

Pick the 2 correct responses below.

A.

Group nodes by function during initial setup and assign them to relevant categories in the cluster management tool.

B.

Configure networking by validating all interfaces on each node, ensuring proper InfiniBand and Ethernet connectivity prior to installing cluster software.

C.

Install Slurm on the head node and then configure the compute nodes’ default OS images.

D.

Complete application containerization, run distributed jobs, and skip validation of node health or storage availability.

Full Access
Question # 13

An engineer needs to validate NVLink Switch functionality on a DGX H100 system with 8 GPUs. Which NCCL command verifies intra-node NVLink bandwidth?

A.

broadcast_perf -b 8 -e 16G -f 2 -g 8 without split configuration

B.

all_reduce_perf -b 8 -e 16G -f 2 -g 4 with NCCL_TESTS_SPLIT= " MOD 2 "

C.

all_reduce_perf -b 8 -e 16G -f 2 -g 1 repeated 8 times

D.

all_reduce_perf -b 8 -e 16G -f 2 -g 8 with NCCL_TESTS_SPLIT= " OR 0x7 "

Full Access
Question # 14

An enterprise IT team has completed the physical installation of an AI Factory with a Spectrum-X Ethernet network connected to all GPU servers. They now need to ensure the environment is ready for scalable AI workload deployment. What is the recommended sequence of validation steps?

A.

Set up Active Directory and LDAP, configure role-based access controls and security settings first, install users, and skip network or hardware performance validation.

B.

Perform application benchmarking first, use performance logs to identify bottlenecks, update switch and server firmware afterward, and then tune the network using performance tests.

C.

Validate the software stack, test link connectivity and port health, run network benchmarks, run OSPF, ensure neighbors are exchanging route information, then stage AI workload tests.

D.

Confirm switch and server firmware configuration, test link connectivity and port health, run network benchmarks, validate the software stack, then stage AI workload tests.

Full Access
Question # 15

An engineer must ensure that a BlueField-3 NIC firmware download matches the cluster’s PSID. Which step is critical before installation?

A.

Check that the DPU’s BMC IP is reachable by ping.

B.

Confirm that the firmware file size matches the DPU’s flash capacity.

C.

Use mstflint -d < PCI_ID > query to validate the device PSID before selecting the firmware image.

D.

Verify that the SHA256 hash of the firmware matches NVIDIA’s public ledger.

Full Access
Question # 16

A system engineer needs to set the vGPU scheduling behavior for all GPUs to share the scheduling equally with the default time slice length. What command should be used?

A.

esxcli system module parameters set -m nvidia -p " NVreg_RegistryDwords=RmPVMRL=0x01 "

B.

esxcli graphics module parameters set -m nvidia -p " NVreg_RegistryDwords=RmPVMRL=0x01 "

C.

esxcli system module parameters set -m nvidia -p " NVreg_RegistryDwords=FRL=0x01 "

D.

esxcli system module parameters set -m nvidia -p " NVreg_RegistryDwords=RmPVMRL=0x00 "

Full Access
Go to page: