1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85
|
/* StarPU --- Runtime system for heterogeneous multicore architectures.
*
* Copyright (C) 2010-2011 Université de Bordeaux 1
*
* StarPU is free software; you can redistribute it and/or modify
* it under the terms of the GNU Lesser General Public License as published by
* the Free Software Foundation; either version 2.1 of the License, or (at
* your option) any later version.
*
* StarPU is distributed in the hope that it will be useful, but
* WITHOUT ANY WARRANTY; without even the implied warranty of
* MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
*
* See the GNU Lesser General Public License in COPYING.LGPL for more details.
*/
#ifndef __STARPU_EXAMPLE_CG_H__
#define __STARPU_EXAMPLE_CG_H__
#include <starpu.h>
#include <math.h>
#include <common/blas.h>
#ifdef STARPU_USE_CUDA
#include <cuda.h>
#include <cublas.h>
#include <starpu_cuda.h>
#endif
#include <starpu.h>
#define DOUBLE
#ifdef DOUBLE
#define TYPE double
#define GEMV DGEMV
#define DOT DDOT
#define GEMV DGEMV
#define AXPY DAXPY
#define SCAL DSCAL
#define cublasdot cublasDdot
#define cublasscal cublasDscal
#define cublasaxpy cublasDaxpy
#define cublasgemv cublasDgemv
#define cublasscal cublasDscal
#else
#define TYPE float
#define GEMV SGEMV
#define DOT SDOT
#define GEMV SGEMV
#define AXPY SAXPY
#define SCAL SSCAL
#define cublasdot cublasSdot
#define cublasscal cublasSscal
#define cublasaxpy cublasSaxpy
#define cublasgemv cublasSgemv
#define cublasscal cublasSscal
#endif
int dot_kernel(starpu_data_handle_t v1,
starpu_data_handle_t v2,
starpu_data_handle_t s,
unsigned nblocks,
int use_reduction);
int gemv_kernel(starpu_data_handle_t v1,
starpu_data_handle_t matrix,
starpu_data_handle_t v2,
TYPE p1, TYPE p2,
unsigned nblocks,
int use_reduction);
int axpy_kernel(starpu_data_handle_t v1,
starpu_data_handle_t v2, TYPE p1,
unsigned nblocks);
int scal_axpy_kernel(starpu_data_handle_t v1, TYPE p1,
starpu_data_handle_t v2, TYPE p2,
unsigned nblocks);
int copy_handle(starpu_data_handle_t dst,
starpu_data_handle_t src,
unsigned nblocks);
#endif /* __STARPU_EXAMPLE_CG_H__ */
|