Skip to content

ganeshutah/PLDI25-Array-Workshop

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PLDI25-Array-Workshop

  • Read (Download) the paper (PDF)

  • Slides (PDF) are here

  • To reproduce the results in this paper, you'll need GPU-FPX's old version with the hang bug fix (for Sec 3.1-3.3)

  • To try out the new Tensor-Core version of GPU-FPX that has merged the functionality into one tool called "detector.so", do this:

  • To reproduce the SRU results of Sec 3.1, please do the following:

  • To reproduce the results in Sec 3.2, please do the following:

    • The version that does not have any issues is main.cu (given here on this Github)

    • The version that has the issues is main1.cu (given here on this Github)

    • Whichever you are running, compile the file as follows: for main.cu, it is as follows:

    • nvcc -O3 -arch=sm_75 -lineinfo -o cuszp main.cu

    • Here, set arch=... version before running to match your GPU)

    • Then run LD_PRELOAD=detector.so ./cuszp

    • change 'detector' to 'analyzer' to conduct analysis

    • Change to nvcc -O3 -arch=sm_75 -lineinfo -o cuszp1 main1.cu

    • Then run LD_PRELOAD=detector.so ./cuszp1 to see the bug reported in the paper

    • main1.cu differs from main.cu in that its tests have higher coverage

    • Each run of cuszp or cuszp1 will generate a test data file and save the compressed data into the file test_data.cuszp.bin

  • To reproduce the results in Sec 3.3 involving PyBlaz, do the following

    • First install PyBlaz as instructed here:
    • Next, you need to run script3.py which is included in this Github

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •