Homework Two - OpenMP programming

Due : Tuesday October 13, 2015
Extended Due Date : Tuesday October 20, 2015

Assignment:

Parallelize the program using OpenMP. To do this compile with

gcc -fopenmp mm.c -lgomp -o mm
When compiling and running you must NOT use gollum itself but one of the nodes node1 to node8. Present the results as a table of run time and spedup (wrt OpenMP on 1 thread). Speedup is defined as T1/Tp where T1 is the time on 1 thread and Tp is the time on p threads. Comment on the results.

The preferred way to time the codes is to use omp_set_num_threads(number) to set the threads and omp_get_wtime() to time the relevant part e.g.

omp_set_num_threads(s);
start = omp_get_wtime();
#pragma omp parallel ...

PUT PARALLEL CODE HERE

end = omp_get_wtime();
printf("time=:%.16g   number of threads: %d\n",end - start, s);
  1. Implement and check the time to do a matrix-matrix product of a 1500x900 matrix A with a 900x1200 matrix B of doubles using 1,2,3,4,5,6, 7 and 8 threads on a gollum node.
    Define Aij = (i+1)*(j+1) and Bij = 1/((double) (i+1)* (double)(j+1)).
    The result matrix C=A*B should be Cij= 900*(double)(i+1)/(double) (j+1) . You should check that the result is correct in each case by comparing A*B with a matrix C with these values.
  2. Modify the code to run similar tests using static with default chuck size, dynamic with chunk size 1 and guided scheduling with chunk size 10.
  3. You may turn in either one program that does all 3 types of scheduling or three seperate programs.
  4. List the various run times and speedup in seperate tables for each type of scheduling and comment on the speedup of each scheduling type and draw conclusions as to which is preferable (and in what circumstances).

    Note that to implement this program you will need to increase the default stack size for both the original thread and the OMP threads. If you do not you will get a segmentation error. To do this you need to execute shell commands similar to:

    ulimit -s unlimited
    export OMP_STACKSIZE=16M
    

    Note: when timing the output remember not to print anything while timing.

This assignment is an individual assignment, to be done on your own without help from other students in the class. However, you may use any materials from any written resource, including web resources.

Instructions for submitting the homework using svn are contained in this file.