Felix Erben
dab8c01c3d
added Baryon code
2019-09-16 17:20:54 +01:00
Michael Marshall
2e963d1a78
Fix location of Grid.h and remove reference to QCD namespace
2019-09-16 15:34:47 +01:00
Michael Marshall
bf52e7cc96
Latest BaryonUtils.h from Felix + my fixes
2019-09-13 18:11:10 +01:00
Michael Marshall
61d017d0a5
Merge GPU support (upstream/develop) into distillation branch.
...
This compiles and looks right ... but may need some testing
* develop: (762 commits)
Tensor ambiguous fix
Fix for GCC preprocessor/pragma handling bug
Trips up NVCC for reasons I dont understand on summit
Fix GCC complaint
Zero() change
Force a couple of things to compile on NVCC
Remove debug code
nvcc error suppress
Merge develop
Reduction finished and hopefully fixes CI regression fail on single precisoin and force
Double precision variants for summation accuracy
Update todo list
Freeze the seed
Fix compiling of MSource::Gauss for single precision
Think the reduction is now sorted and cleaned up
Fix force term
Printing improvement
GPU reduction fix and also exit backtrace option
GPU friendly
Simplify the comms benchmark
...
# Conflicts:
# Grid/communicator/SharedMemoryMPI.cc
# Grid/qcd/action/fermion/WilsonKernelsAsm.cc
# Grid/qcd/action/fermion/implementation/StaggeredKernelsAsm.h
# Grid/qcd/smearing/StoutSmearing.h
# Hadrons/Modules.hpp
# Hadrons/Utilities/Contractor.cc
# Hadrons/modules.inc
# tests/forces/Test_dwf_force_eofa.cc
# tests/forces/Test_dwf_gpforce_eofa.cc
2019-09-13 13:30:00 +01:00
gfilaci
a7fa86dc29
MooeeInv improvement for DW EOFA + comments
2019-09-05 12:05:21 +01:00
gfilaci
fdd9b14e82
speed up MooeeInvDag for DWF EOFA
2019-09-02 14:49:51 +01:00
gfilaci
e66669d300
fast MooeeInv for EOFA
2019-09-02 14:26:13 +01:00
gfilaci
0efaf3c4fa
access M5D coeffs through pointers
2019-09-02 11:33:00 +01:00
gfilaci
3ef519aaa4
fast MooeeInv
2019-09-02 11:18:14 +01:00
Peter Boyle
b473405652
Tensor ambiguous fix
2019-08-29 09:36:41 -05:00
Christopher Kelly
114ebb7914
Fixed Lanczos calling aligned alloc in threaded region hitting up against pointer-cache no-threading restrictions
...
Fixed Lattice::reset not compiling with new Grid explicit memory region handling
Fixed memory leak in Lattice::resize that occurs when data region has been previously allocated
2019-08-26 16:47:44 -04:00
Peter Boyle
9b7a6d197f
Fix for GCC preprocessor/pragma handling bug
2019-08-23 14:37:46 +01:00
Peter Boyle
28d6be2a4e
Fix GCC complaint
2019-08-22 18:56:37 +01:00
ad01290545
remove remnants of the namespace QCD
2019-08-19 20:30:33 +01:00
Peter Boyle
be37dfb6f8
Remove debug code
2019-08-15 01:31:40 +01:00
Peter Boyle
e279b2be29
Merge develop
2019-08-14 23:01:59 +01:00
Peter Boyle
48e6efc7c9
Merge branch 'develop' into feature/gpu-port
...
Conflicts:
Grid/qcd/action/fermion/WilsonKernelsAsm.cc
Grid/qcd/action/fermion/implementation/ImprovedStaggeredFermionImplementation.h
Grid/qcd/action/fermion/implementation/StaggeredKernelsAsm.h
benchmarks/Benchmark_comms.cc
2019-08-14 18:56:54 +01:00
Peter Boyle
3e49dc8a67
Reduction finished and hopefully fixes CI regression fail on single precisoin and force
2019-08-14 15:18:34 +01:00
Peter Boyle
96ac56cace
Double precision variants for summation accuracy
2019-08-14 13:08:01 +01:00
Peter Boyle
ce97638bac
Think the reduction is now sorted and cleaned up
2019-08-11 11:09:01 +01:00
Peter Boyle
53e3ab4131
Fix force term
2019-08-11 11:06:13 +01:00
Fionn O hOgain
d566637cec
Merge branch 'develop' of github.com:fionnoh/Grid into feature/A2A_current_insertion
2019-08-07 12:11:40 +01:00
Felix Erben
b875edceab
Merge branch 'feature/distil' of https://github.com/mmphys/Grid into feature/distil
...
Conflicts:
Grid/qcd/utils/BaryonUtils.h
Hadrons/Modules/MContraction/Baryon2.hpp
2019-08-05 14:19:43 +01:00
Felix Erben
29df60c0cb
some debugging stuff
2019-08-05 14:10:04 +01:00
Michael Marshall
6f40021842
Fixed compiler errors: TODO: Felix, please validate
2019-08-01 19:57:59 +01:00
Peter Boyle
9cd33a7b9c
Printing improvement
2019-07-31 08:01:24 +01:00
Peter Boyle
639dc1ab21
GPU reduction fix and also exit backtrace option
2019-07-31 01:23:23 +01:00
Peter Boyle
9117f61109
GPU friendly
2019-07-31 01:22:54 +01:00
Felix Erben
e66d48c142
second way to compute baryons - qdp style
2019-07-30 13:46:59 +01:00
Peter Boyle
9dad7a0094
Reproducible reduction and axpy_norm offload from Gianluca.
...
Hopefully get CG running entirely on GPU
2019-07-30 00:14:12 +01:00
Peter Boyle
8c6016f717
Merge pull request #219 from mmphys/feature/include
...
Housekeeping. #include <Grid.h> ---> #include <Grid/Grid.h>
2019-07-29 23:08:01 +01:00
Peter Boyle
1282e1067f
Do the force term on the accelerator too. Needed particularly because comms buffers
...
are device memory.
2019-07-29 22:58:35 +01:00
Peter Boyle
275c1c920f
More info dump on error from CUDA
2019-07-26 12:18:53 +01:00
Peter Boyle
fe700a183a
Getting HMC to run
2019-07-26 12:18:29 +01:00
Peter Boyle
34108296cd
Merge branch 'develop' into feature/gpu-port
...
Conflicts:
Grid/simd/Grid_avx512.h
2019-07-20 17:05:35 +01:00
Peter Boyle
76c704b84b
Intrinsics for CLANG are now fixed in v6
2019-07-20 16:52:24 +01:00
Peter Boyle
ce255ec359
Relocate to fix build failure for comms none
2019-07-20 16:37:03 +01:00
Peter Boyle
1c096626cb
Hypercube defaults to on if HPE detected, but override to off possible
2019-07-20 16:06:16 +01:00
Peter Boyle
25ba4c5f80
Merge branch 'develop' into feature/gpu-port
...
Conflicts:
HMC/Mobius2p1fEOFA.cc
tests/forces/Test_rect_force.cc
2019-07-19 11:01:55 +01:00
Peter Boyle
9e926e3fc5
Build fix in develop
2019-07-19 10:01:52 +01:00
Peter Boyle
775eaee199
Fix for suspected Intel 2018.1 compiler bug under O3
2019-07-19 07:57:34 +01:00
Felix Erben
56cefadf9b
gamma matrices as input
2019-07-18 17:46:43 +01:00
Peter Boyle
331f5a53dc
New header
2019-07-18 14:51:09 +01:00
Peter Boyle
a23dc295ac
Remove compiler errors and warnings
2019-07-18 14:47:02 +01:00
ferben
11a8668d19
bugfix in Baryonutils
2019-07-18 14:44:55 +01:00
ferben
cded7670d0
new utils for baryons
2019-07-18 14:29:04 +01:00
ferben
feb029fb66
new utils for baryons
2019-07-18 14:24:16 +01:00
Peter Boyle
08904f830e
Merge develop
2019-07-16 11:59:56 +01:00
Peter Boyle
fa9cd50c5b
Merge branch 'develop' into feature/gpu-port
2019-07-16 11:55:17 +01:00
Peter Boyle
42c1dbb1d1
General local stencil first cut for Patrick force term
2019-07-14 14:04:28 +01:00