January 23, 2014
Director of Software Development
The Two Alternatives
We are going to compare two approaches.
We put together two separate workstations of different capabilities as shown below:
We used the following frame size and parameters for our extraction.
Notes and Comments
The GPU is not giving us an order of magnitude improvement in throughput when compared to a multi-threaded CPU approach. How much of this is due to fundamental limitations and how much is due to our relative inexperience with GPU coding is still unknown. More investigation will be done.
We are not sure why the GPU fails for larger number of frames. It might be due to our programming code as it would appear there is sufficient memory in the GPU to hold both the input bitmap and the resulting frame data.
These times do not include additional time it would take to move the DMD frame results from memory out to the actual mask writer.
The faster and more capable GPU does not crank out results much quicker. This implies to me that a limitation may be in transferring data into the GPU.
It would be good to get seme feedback from mask writer equipment companies as to the required frame rate.
|Industry Players||References and Papers||Hardware and Software Solutions|