You need to log in to make submissions.
Please read the general instructions for this exercise first. Here are the additional instructions specific to this task:
Implement a simple baseline solution for the GPU. Make sure it works correctly and that it is reasonably efficient. Make sure that all performance-critical parts are executed on the GPU; you can do some lightweight preprocessing and postprocessing also on the CPU. Remember to check all CUDA operations for errors. In this task, you are permitted to use single-precision floating point numbers.
I will first run all kinds of tests to see that your code works correctly. You can try it out locally by running ./grading test
, but please note that your code has to compile and work correctly not only on your own computer but also on our machines.
If all is fine, I will run the benchmarks. You can try it out on your own computer by running ./grading benchmark
, but of course the precise running time on your own computer might be different from the performance on our grading hardware.
Name | Operations | Parameters |
---|---|---|
benchmarks/1a | 1,004,000,000 | nx = 1000, ny = 1000 |
the input contains 1000 × 1000 pixels, and the output should contain 1000 × 1000 pixels | ||
benchmarks/1b | 1,000,995,003 | nx = 999, ny = 999 |
the input contains 999 × 999 pixels, and the output should contain 999 × 999 pixels | ||
benchmarks/1c | 1,007,011,005 | nx = 1001, ny = 1001 |
the input contains 1001 × 1001 pixels, and the output should contain 1001 × 1001 pixels | ||
benchmarks/2a | 16,016,000,000 | nx = 1000, ny = 4000 |
the input contains 4000 × 1000 pixels, and the output should contain 4000 × 4000 pixels | ||
benchmarks/2b | 16,016,000,000 | nx = 1000, ny = 4000 |
the input contains 4000 × 1000 pixels, and the output should contain 4000 × 4000 pixels |
Here “operations” is our rough estimate of how many useful arithmetic operations you will at least need to perform in this benchmark, but of course this will depend on exactly what kind of an algorithm you are using.
In this task your submission will be graded using benchmarks/2b: the input contains 4000 × 1000 pixels, and the output should contain 4000 × 4000 pixels.
The point thresholds are as follows. If you submit your solution no later than on Saturday, 31 August 2024, at 23:59:59 (Helsinki), your score will be:
Running time | Points |
---|---|
≤ 5.000 sec | 1 |
≤ 3.000 sec | 2 |
≤ 2.000 sec | 3 |
≤ 1.000 sec | 4 |
≤ 0.500 sec | 5 |
For late submissions you will not get any points.