The Quantum Exact Simulation Toolkit v4.0.0
Loading...
Searching...
No Matches
Synchronisation

Functions for copying memory between a Qureg's CPU (RAM) and GPU (VRAM) memory. More...

Functions

void syncQuregFromGpu (Qureg qureg)
 
void syncQuregToGpu (Qureg qureg)
 
void syncSubQuregFromGpu (Qureg qureg, qindex localStartInd, qindex numLocalAmps)
 
void syncSubQuregToGpu (Qureg qureg, qindex localStartInd, qindex numLocalAmps)
 

Detailed Description

Functions for copying memory between a Qureg's CPU (RAM) and GPU (VRAM) memory.

These functions are only necessary when the user wishes to manually probe or modify the Qureg amplitudes (rather than use functions like getQuregAmps() and setQuregAmps()), to ensure that the CPU and GPU copies of the Qureg are identical. These functions have no effect when running without GPU-acceleration, but remain legal and harmless to call, to achieve platform agnosticism.

Function Documentation

◆ syncQuregFromGpu()

void syncQuregFromGpu ( Qureg qureg)
Note
Documentation for this function or struct is under construction!
Warning
This function has not yet been unit tested and may contain bugs. Please use with caution!

Definition at line 397 of file qureg.cpp.

397 {
398 validate_quregFields(qureg, __func__);
399
400 // permit this to be called even in non-GPU mode
401 if (qureg.isGpuAccelerated)
402 gpu_copyGpuToCpu(qureg); // syncs then overwrites all local CPU amps
403}

Referenced by TEST_CASE(), and TEST_CASE().

◆ syncQuregToGpu()

void syncQuregToGpu ( Qureg qureg)
Note
Documentation for this function or struct is under construction!
Warning
This function has not yet been unit tested and may contain bugs. Please use with caution!

Definition at line 390 of file qureg.cpp.

390 {
391 validate_quregFields(qureg, __func__);
392
393 // permit this to be called even in non-GPU mode
394 if (qureg.isGpuAccelerated)
395 gpu_copyCpuToGpu(qureg); // syncs then overwrites all local GPU amps
396}

◆ syncSubQuregFromGpu()

void syncSubQuregFromGpu ( Qureg qureg,
qindex localStartInd,
qindex numLocalAmps )
Note
Documentation for this function or struct is under construction!
Warning
This function has not yet been unit tested and may contain bugs. Please use with caution!

Definition at line 425 of file qureg.cpp.

425 {
426 validate_quregFields(qureg, __func__);
427 validate_localAmpIndices(qureg, localStartInd, numLocalAmps, __func__);
428
429 // the above validation communicates for node consensus in
430 // distributed settings, because params can differ per-node.
431 // note also this function accepts statevectors AND density
432 // matrices, because the latter does not need a bespoke
433 // (row,col) interface, because the user can only access/modify
434 // local density matrix amps via a flat index anyway!
435
436 // we permit this function to do nothing when not GPU-accelerated
437 if (!qureg.isGpuAccelerated)
438 return;
439
440 // otherwise, every node merely copies its local subset, which
441 // may differ per-node, in an embarrassingly parallel manner
442 gpu_copyGpuToCpu(&qureg.gpuAmps[localStartInd], &qureg.cpuAmps[localStartInd], numLocalAmps);
443}

◆ syncSubQuregToGpu()

void syncSubQuregToGpu ( Qureg qureg,
qindex localStartInd,
qindex numLocalAmps )
Note
Documentation for this function or struct is under construction!
Warning
This function has not yet been unit tested and may contain bugs. Please use with caution!

Definition at line 406 of file qureg.cpp.

406 {
407 validate_quregFields(qureg, __func__);
408 validate_localAmpIndices(qureg, localStartInd, numLocalAmps, __func__);
409
410 // the above validation communicates for node consensus in
411 // distributed settings, because params can differ per-node.
412 // note also this function accepts statevectors AND density
413 // matrices, because the latter does not need a bespoke
414 // (row,col) interface, because the user can only access/modify
415 // local density matrix amps via a flat index anyway!
416
417 // we permit this function to do nothing when not GPU-accelerated
418 if (!qureg.isGpuAccelerated)
419 return;
420
421 // otherwise, every node merely copies its local subset, which
422 // may differ per-node, in an embarrassingly parallel manner
423 gpu_copyCpuToGpu(&qureg.cpuAmps[localStartInd], &qureg.gpuAmps[localStartInd], numLocalAmps);
424}