Practical Parallel Programming - Project

Objectives

For a (frequently occurring) computational problem you should develop different parallel solutions, at least the ones discussed in the practica: vectorization, multi-threading and message-passing.

Implement or get the sequential version.
Create naive, parallel implementations with the 3 techniques. Do not optimize, just make sure everything works and the result is correct.
Compare the result of all your parallel implementations with the sequential one. Also measure speedup, performance (flops) and bandwidth. We advice you to make some management code (or script) that automates this!

Start with the MPI-MT code that can be found on the Hydra doc page.

Optimize the naive implementation by trying to overcome the anti-parallel patterns (inefficiencies) it contains.
Study alternative implementations and compare their performance.

Below, in the deliverables section, we described what we expect from the performance study.

Organization

Here are the rules for the project:

The project will be under the guidance of mainly Jan Lemeire (and Nick Wouters).
You can work alone or in groups of two.
The deadline to choose a topic is December 13th, 2024.
Meet us (make an individual appointment which can be via Teams):

once you have the sequential code + ideas about the parallelization. To discuss the ideas and put you on the right track.
somewhere halfway the project to discuss your current problems, to let us give advice and to define the expected end result.

The deadline for the project is 3 days before the oral exam in January.

We expect the following deliverables:

All relevant code related to the project.

sequential code + parallel implementations
the parallel versions should check their result with the one of the sequential version to proof that the outcome is correct!
Remove object and exe-files (and all other intermediate files). Use Build -> clean for instance and remove the .vs folder manually. Minimally, we need source files, and solution (.sln) and project file (.vcxproj).

A short report that describes:
- The problem (brief).
- The different implementations. You can be brief here, since we have your source code. A diagram or scheme might be helpful here.
- Links to sources of information and of source code.
- Description of the parallel system used:
- Most important: a discussion of the performance of the different implementations

Topics

You are free in choosing a topic. For instance, you can parallelize the algorithm of your thesis, or another one that interests you.
For each topic we will give some pointers to problem descriptions together with a number of possible implementations.
We will also try to give an estimation of the difficulty level and feasibility.

Here some suggestions:

Reductions: sum, product, mul-add or max of an array.

Compare with a non-reduction operation having the same number of computations. E.g. compare the global sum with adding a constant to all elements of a vector. The number of operations is the same, except that the reduction needs a specific structure with synchronization.

Sorting. Although an very common algorithm, it will still be interesting to see some results. For vectorization, some special intrinsic functions exist.
Convolutions, like a sobel filter or a Gaussian blur. With large filters good speedups can be achieved.

Variant: simulation of temperature evolution. Sequential code.

Discrete Optimization Problem. Choose a problem, like the shift puzzle explained here. See the theory chapter devoted to it.
Genetic algorithms.

Practical Parallel Programming - Project

Objectives

Organization

Topics

More topics

Topics 2016-2017 (click here)

a) Pattern recognition in signals

c) Solving linear equations