Peter Boyle
|
c74d11e3d7
|
PVdagM MG
|
2025-02-01 11:04:13 -05:00 |
|
|
c4fc972fec
|
Merge branch 'feature/deprecate-uvm' into develop
|
2025-01-31 16:32:36 +00:00 |
|
|
8cf809e231
|
Best results on Aurora so far
|
2025-01-31 16:14:45 +00:00 |
|
|
94019a922e
|
Significantly better performance on Aurora without using pipeline mode
|
2025-01-30 16:36:46 +00:00 |
|
|
d6b2727f86
|
Pipeline mode getting better -- 2 nodes @ 10TF/s per node on Aurora
|
2025-01-29 09:22:21 +00:00 |
|
|
74a4f43946
|
Optional host buffer bounce for no CUDA aware MPI
|
2025-01-28 15:22:46 +00:00 |
|
|
1caf8b0f86
|
Rename
|
2025-01-28 15:22:37 +00:00 |
|
Peter Boyle
|
3f3661a86f
|
Heading towards PVdagM multigrid
|
2025-01-17 14:33:35 +00:00 |
|
|
8fe429346f
|
Dslash testing for reproduce
|
2024-11-11 23:11:11 +00:00 |
|
Peter Boyle
|
5a4f9bf2e3
|
Force the ROCM version
|
2024-10-29 18:12:31 -04:00 |
|
Peter Boyle
|
b91fc1b6b4
|
Merge branch 'feature/boosted' into feature/deprecate-uvm
Fixed boosted free field test
|
2024-10-28 16:53:09 -04:00 |
|
Peter Boyle
|
eafc150034
|
Test fft asserts
|
2024-10-23 16:46:26 -04:00 |
|
Peter Boyle
|
2877f1a268
|
Verbose reduce
|
2024-10-23 15:14:16 -04:00 |
|
Peter Boyle
|
1e893af775
|
GPU happy
|
2024-10-23 14:52:15 -04:00 |
|
Peter Boyle
|
d9f430a575
|
Happy GPU
|
2024-10-23 14:51:16 -04:00 |
|
Peter Boyle
|
63abe87f36
|
Memory manager verbose improvements that were useful to track an error
|
2024-10-23 14:49:13 -04:00 |
|
Peter Boyle
|
368d649c8a
|
feature/deprecate-uvm happier -- preallocate device resident neigbour table
|
2024-10-23 14:47:55 -04:00 |
|
Peter Boyle
|
5603464f39
|
Fix in partial fraction import/export physical and
make the GPU happier on the deprecate-uvm -- don't use static vectors, make member of class
|
2024-10-23 14:45:58 -04:00 |
|
Peter Boyle
|
655c79f39e
|
Suppress warning on partial override
|
2024-10-23 14:44:41 -04:00 |
|
Peter Boyle
|
565b231c03
|
Nvcc happy
|
2024-10-23 14:44:17 -04:00 |
|
Peter Boyle
|
62a9f180fa
|
NVCC happy
|
2024-10-23 14:44:04 -04:00 |
|
Peter Boyle
|
5ae77876a8
|
Meson field and Aslash field on GPU; some compiler warning removed
|
2024-10-18 19:08:06 -04:00 |
|
Peter Boyle
|
4ed2c2c74f
|
Config command
|
2024-10-18 13:58:33 -04:00 |
|
Peter Boyle
|
955da582b6
|
Working on NVCC
|
2024-10-18 13:58:03 -04:00 |
|
Peter Boyle
|
11b07b950d
|
Vanilla linux compile, assuming spack prerequisites
|
2024-10-18 13:57:40 -04:00 |
|
Peter Boyle
|
8f70cfeda9
|
Clean up
|
2024-10-18 13:56:53 -04:00 |
|
Peter Boyle
|
ce64271048
|
Remove the copying version
|
2024-10-18 13:56:24 -04:00 |
|
|
5cc4f3241d
|
Meson field test
|
2024-10-18 15:42:30 +00:00 |
|
Peter Boyle
|
6815e138b4
|
Boosted fermion attempt
|
2024-10-17 18:37:33 +01:00 |
|
|
a78a61d76f
|
Update configure
|
2024-10-15 14:38:45 +00:00 |
|
|
2eff3f34ed
|
Alternate reduction; default to grids own but make a configure flag
--enable-reduction=grid|mpi
|
2024-10-15 14:36:06 +00:00 |
|
|
03687c1d62
|
Final version of test, closer to original again
|
2024-10-15 14:35:17 +00:00 |
|
|
febfe4e77f
|
Make my own reduction a configure flag
|
2024-10-15 14:32:35 +00:00 |
|
|
4d1aa134b5
|
Use normal reduction, configure flag to force deterministic
|
2024-10-15 14:32:11 +00:00 |
|
|
5ec879860a
|
Odd rounding issue - bears looking into
|
2024-10-15 14:30:54 +00:00 |
|
Peter Boyle
|
f617468e04
|
Update Lattice_base.h
|
2024-10-11 10:39:16 -04:00 |
|
|
b728af903c
|
Fast axpy norm under CFLAG
|
2024-10-11 03:23:09 +00:00 |
|
|
54f1999030
|
axpy_norm_fast -- wasn't using the determinstic MPI sum causing issues
|
2024-10-11 03:22:18 +00:00 |
|
|
fd58f0b669
|
Return ok
|
2024-10-11 03:21:21 +00:00 |
|
|
c5c67b706e
|
cl::sycl -> SYCL
|
2024-10-10 22:04:12 +00:00 |
|
|
be7a543e2c
|
Revert barriers -- these were not the problem
|
2024-10-10 22:03:29 +00:00 |
|
|
68f112d576
|
New software moves cl::sycl
|
2024-10-10 22:03:04 +00:00 |
|
|
ec1395a304
|
Better flight logging
|
2024-10-10 22:01:57 +00:00 |
|
|
beb0e474ee
|
Use deterministic own brand reduction
|
2024-10-10 22:01:24 +00:00 |
|
|
2b5fdcbbc5
|
New software version
|
2024-10-10 21:59:02 +00:00 |
|
|
295127d456
|
Deterministic homebrew reduction
|
2024-10-10 21:58:26 +00:00 |
|
|
7dcfb13694
|
New software stack
|
2024-10-10 21:57:35 +00:00 |
|
Peter Boyle
|
ee4046fe92
|
Added a dimension ordered column sum based reduction for scalar.
Removes dependence on MPI_Allreduce and allows for work around on
systems where this is bollox.
|
2024-09-27 09:26:03 -04:00 |
|
Peter Boyle
|
2a9cfeb9ea
|
New files
|
2024-09-26 14:23:29 -04:00 |
|
Peter Boyle
|
1147b8ea40
|
Cheby poly setup
|
2024-09-26 14:20:32 -04:00 |
|