James Harrison
|
c5b23c367e
|
QedFVol: Fix segmentation fault when multiple propagator modules are used.
|
2018-02-05 11:46:33 +00:00 |
|
Vera Guelpers
|
b6fe03eb26
|
BugFix: Now the stochatic EM potential weight is generated when calling for the first time
|
2018-02-02 15:29:38 +00:00 |
|
James Harrison
|
f37ed4958b
|
Implement IR improvement, with coefficients set in input file.
|
2018-02-02 11:56:51 +00:00 |
|
paboyle
|
b9b5bdfc3a
|
Proper offload (accelerator access) will require a mutable copy lambda.
|
2018-02-02 11:38:19 +00:00 |
|
paboyle
|
51eb2c5dfc
|
Make referencign the stencil and all info required to evaluate the kernel
accelerator marked up
|
2018-02-02 11:37:13 +00:00 |
|
paboyle
|
ede0dff794
|
Mark up as an accelerator function
|
2018-02-02 11:36:44 +00:00 |
|
paboyle
|
aa6de818e2
|
Copy data needed by Kernels out of the grid object to avoid host reference
|
2018-02-02 11:36:11 +00:00 |
|
paboyle
|
dcf6517a93
|
Accelerator offload and copy Opt into the kernel for GPU host var safety
|
2018-02-02 11:35:35 +00:00 |
|
paboyle
|
a308dff410
|
accelerator loop, copy Opt into the GPU
|
2018-02-02 11:34:37 +00:00 |
|
paboyle
|
14ba20898a
|
Accelerator loop the key kernel call
|
2018-02-02 11:30:07 +00:00 |
|
paboyle
|
a53d3ee19a
|
Add Opt to the lambda capture to get it into the GPU
|
2018-02-02 11:28:39 +00:00 |
|
paboyle
|
5df435319d
|
Use constexpr
|
2018-02-02 11:27:56 +00:00 |
|
paboyle
|
0da2d3e222
|
accelerator off load some more stuff
|
2018-02-02 11:27:35 +00:00 |
|
paboyle
|
9c9dfbfa78
|
Force accelerator
|
2018-02-02 11:25:09 +00:00 |
|
paboyle
|
e4df025d01
|
Accelerator related
|
2018-02-01 23:20:05 +00:00 |
|
paboyle
|
cfeda9d536
|
constexpr on const ints
|
2018-02-01 22:59:12 +00:00 |
|
paboyle
|
4450b1993a
|
Offload
|
2018-02-01 22:45:47 +00:00 |
|
paboyle
|
d03ce5c2a4
|
Provide a way to get around std::vector for a known type on device.
Use template specialisation to access a private member in the Clang++ STL implementation
|
2018-02-01 22:44:25 +00:00 |
|
paboyle
|
7d6522c1ef
|
Accelerator inline
|
2018-02-01 22:43:56 +00:00 |
|
paboyle
|
b96832a922
|
Accelerator inline
|
2018-02-01 22:43:26 +00:00 |
|
paboyle
|
5d7af47b05
|
accelerator_inline
|
2018-02-01 22:42:54 +00:00 |
|
paboyle
|
053ef25c90
|
constexpr makes GPU happy
|
2018-02-01 22:42:29 +00:00 |
|
paboyle
|
8ae77d3706
|
Small simplification of FermionOperatorImpl towards GPU but not there yet
|
2018-02-01 22:41:54 +00:00 |
|
Peter Boyle
|
896f3a8002
|
Fix to MPI for Hokusai system
|
2018-02-01 18:51:51 +00:00 |
|
James Harrison
|
5f85473d6b
|
QedFVol: Move Projection class into Result class
|
2018-02-01 16:16:13 +00:00 |
|
James Harrison
|
ac3b0ebc58
|
QedFVol: New structure for ChargedProp output files
|
2018-02-01 12:31:32 +00:00 |
|
Guido Cossu
|
f0fcdf75b5
|
Update README.md
|
2018-01-30 12:44:20 +01:00 |
|
Guido Cossu
|
53bffb83d4
|
Updating README with new SKL target
|
2018-01-30 12:42:36 +01:00 |
|
Guido Cossu
|
cd44e851f1
|
Fixing compilation error in FundtoHirep
|
2018-01-30 06:04:30 +01:00 |
|
paboyle
|
79b50feacf
|
fixme updates
|
2018-01-29 16:00:40 +00:00 |
|
Guido Cossu
|
fb24e3a7d2
|
Adding utilities for perf profiling
|
2018-01-29 11:11:45 +01:00 |
|
Guido Cossu
|
655a69259a
|
Added support for GCC compilation for Skylake AVX512
|
2018-01-28 17:02:46 +01:00 |
|
paboyle
|
c67c1544cd
|
abs no compile on travis fix attempt
|
2018-01-28 10:26:04 +00:00 |
|
paboyle
|
e657f9a344
|
OMP collapse changes to make NVCC happy
|
2018-01-28 01:21:53 +00:00 |
|
paboyle
|
b6ebf35af5
|
Intel compiler doesn't like Nvidia error disable pragmas
|
2018-01-28 01:03:10 +00:00 |
|
paboyle
|
604c05f4b8
|
parallel_for elimination -> thread_loop
|
2018-01-28 01:01:36 +00:00 |
|
paboyle
|
70e276e1ab
|
parallel_for elimination -> thread_loop
|
2018-01-28 01:01:14 +00:00 |
|
paboyle
|
9472b02771
|
Parallel_for elimination -> thread_loop.
|
2018-01-28 01:00:55 +00:00 |
|
paboyle
|
9597ab94eb
|
Zero changes, swap on lattice type.
|
2018-01-27 23:51:40 +00:00 |
|
paboyle
|
ce4da83bc2
|
Zero changes, literally
|
2018-01-27 23:51:10 +00:00 |
|
paboyle
|
d557f3ef77
|
Zero changes (literally) and also a warning elimination
|
2018-01-27 23:50:43 +00:00 |
|
paboyle
|
f574c20118
|
Zero changes, __VA_ARGS__ and swap
|
2018-01-27 23:50:17 +00:00 |
|
paboyle
|
f102897385
|
VA_ARGS to make comma safe automatic
|
2018-01-27 23:49:47 +00:00 |
|
paboyle
|
d6fce3e498
|
Zero changes, literally
|
2018-01-27 23:48:01 +00:00 |
|
paboyle
|
2d0bcc2606
|
Zero changes, acceleartor on kernels and some thread loop changes
|
2018-01-27 23:47:38 +00:00 |
|
paboyle
|
45df59720e
|
Zero changes and VA_ARGS changes
|
2018-01-27 23:46:58 +00:00 |
|
paboyle
|
44ef5bc207
|
Zero changes (literally speaking).
|
2018-01-27 23:46:28 +00:00 |
|
paboyle
|
98af36217a
|
Zero changes. (I mean literally)
|
2018-01-27 23:46:02 +00:00 |
|
James Harrison
|
4e0cf0cc28
|
QedFVol: Fix bug in ScalarVP.cc due to double use of temporary object. Still getting mpi3 errors when configured with enable-comms=mpi[-auto].
|
2018-01-27 15:15:25 +00:00 |
|
Guido Cossu
|
507c4e9efc
|
Correcting an missing semicolumn in avx512
|
2018-01-27 10:59:55 +01:00 |
|