Peter Boyle
1e29c59bcc
FFT: cache plans per vobj type across calls
...
Plans are created lazily on the first FFT_dim call and reused for all
subsequent calls on the same FFT object. PlanCreate<vobj>() can be
called explicitly to pre-warm the cache. PlanDestroy() must be called
before switching to a different vobj type; the destructor cleans up any
live plans automatically.
Update Test_fft.cc and Test_fftf.cc to call PlanDestroy() between the
LatticeComplex and LatticeSpinMatrix sections that reuse the same FFT object.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com >
2026-05-19 15:12:10 -04:00
Peter Boyle
fe0db53842
FFT offload to GPU and MUCH faster comms.
...
40x speed up on Frontier
2025-08-21 16:45:38 -04:00
paboyle
9e6a4a4737
Assertion updates to macros (mostly) with backtrace.
...
WIlson flow to include options for DBW2, Iwasaki, Symanzik.
View logging for data assurance
2025-08-07 15:48:38 +00:00
Peter Boyle
eafc150034
Test fft asserts
2024-10-23 16:46:26 -04:00
paboyle
066544281f
Deprecate UVM
2024-09-17 13:34:27 +00:00
Peter Boyle
d201277652
Expose Nc as a compile time configure option.
...
Remove precision option
2020-10-07 13:07:00 -04:00
Peter Boyle
b57a4d32aa
Merge branch 'develop' into feature/gpu-port
2018-12-13 05:11:34 +00:00
Vera Guelpers
04190ee7f3
5D free propagator for DWF and boundary conditions for free propagators
2018-05-03 12:31:36 +01:00
paboyle
285deab432
Coordinate handling GPU friendly. Avoid std::vector
2018-02-24 22:19:28 +00:00
paboyle
98af36217a
Zero changes. (I mean literally)
2018-01-27 23:46:02 +00:00
paboyle
d74c21a386
GLobal edit for QCD namespace removal & NAMESPACE macros
2018-01-15 09:37:58 +00:00
paboyle
6ebf9f15b7
Splitting communicators first cut
2017-06-22 08:14:34 +01:00
portelli
a37e71f362
New automatic implementation of gamma matrices, Meson and SeqGamma are broken
2017-01-23 19:13:43 -08:00
portelli
ec232af851
Photon.h references removed
2016-11-03 14:34:16 +00:00
portelli
2854e601e6
FFT test typo
2016-11-03 14:09:47 +00:00
portelli
ca21003f01
Merge branch 'feature/fft-opt' into feature/feynman-rules
...
# Conflicts:
# lib/FFT.h
# lib/qcd/action/fermion/WilsonFermion5D.h
# tests/core/Test_fft.cc
2016-10-26 18:44:47 +01:00
portelli
14ddf2c234
more FFT optimisations
2016-10-26 17:36:26 +01:00
Azusa Yamaguchi
bca861e112
Note:FFT shoud be GridFFT (Not change yet).
...
Gauge fix with FFt is added (tests/core)
2016-10-25 14:21:48 +01:00
paboyle
db749f103f
Add Wilson, DWF, Overlap feynman rule tests
2016-10-10 23:48:35 +01:00
paboyle
34f887ca1c
Test_fft not complete; preparing for tests of momentum space DWF and Overlap feynman rules but not there yet.
2016-09-26 09:44:36 +01:00
paboyle
62febd2823
Wilson prop test
2016-08-31 00:23:09 +01:00
paboyle
ff6da364e8
FFT double and single precision gives good performance now in multithreaded code.
2016-08-24 15:05:00 +01:00
paboyle
e423a09974
FFT improved and test_FFT passing under MPI 8 processes, 8^4 for LatticeComplexD and LatticeSpinMatrixD
2016-08-18 02:23:21 +01:00
paboyle
17097a93ec
FFTW test ran over 4 mpi processes.
2016-08-17 01:33:55 +01:00