Based on the amd-builtin, but explicitly vectorized for all sizes (not just float4), and includes a vectorized double implementation. Passes piglit (float) tests on pitcairn. Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 268708
11 lines
176 B
Common Lisp
11 lines
176 B
Common Lisp
#include <clc/clc.h>
|
|
|
|
#include "math.h"
|
|
|
|
#ifdef cl_khr_fp64
|
|
#pragma OPENCL EXTENSION cl_khr_fp64 : enable
|
|
#endif
|
|
|
|
#define __CLC_BODY <fdim.inc>
|
|
#include <clc/math/gentype.inc>
|