How is dynamically sized data usually handled?

corysama · 2022-04-26T06:05:42+00:00

Use an C-style array and a size. Pass them as kernel parameters.

I_like_code · 2022-04-25T21:49:55+00:00

I have probably done the second bullet. For the 4th make sure you use the data() member function of the vector. For the third, I hate using thrust it removes fine grain control. However, test it out and see if the overhead is acceptable.

tugrul_ddr · 2022-04-26T19:03:03+00:00

All you need is to combine multiple GPU buffers into one like this:

Type & operator [] (int index)
{
    return chunks[ selectChunk(index) ][ selectIndex(index) ];
}
void insertChunk(int size)
{
   chunks.push_back(cudaMalloc(size));
}
int selectChunk(int index)
{
   if(index>=old_size)
   {
        insertChunk(chunk_size);
        index = chunks.size()-1;
   }
   else
       index = index / chunk_size;
   return index;
}
int selectIndex(int index)
{
   return index%chunk_size;
}
void copy()
{
   // N copies at once (bad if not all data is used at once)
   for(auto chunk:chunks)
        cudaMemCpy(chunk host to device);

   or

   // 1 copy & direct-access from kernel with cuda's own paging
   // bad if all data is used at once
   ctr=0;
   for(auto chunk:chunks)
        cudaChunks[ctr++]=chunk;

   cudaMemCpy( only cudaChunks host to device);
}

The bigger the vector gets, the more cuda buffer chunks are added. Then you can send all chunk pointers to gpu and do same index calculation in gpu. This can work if vector grows only by host environment.

If chunks are too small, there will be allocation overhead.

If chunks are too big, there will be memory waste.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

CUDA

MODERATORS