DART-MPI: call into MPI every once in a while for local put/get by devreal · Pull Request #712 · dash-project/dash

devreal · 2020-06-25T14:06:44Z

We need to call into MPI every once in a while as otherwise polling on a local variable may not trigger progress if that is needed.

codecov · 2020-06-25T14:30:14Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 85.19%. Comparing base (425857a) to head (151d79d).
Report is 124 commits behind head on development.

Additional details and impacted files

@@               Coverage Diff               @@
##           development     #712      +/-   ##
===============================================
+ Coverage        84.06%   85.19%   +1.12%     
===============================================
  Files              336      336              
  Lines            24954    24945       -9     
  Branches         11349    11540     +191     
===============================================
+ Hits             20977    21251     +274     
  Misses            3693     3693              
+ Partials           284        1     -283

Files with missing lines	Coverage Δ
dart-impl/mpi/src/dart_communication.c	`69.44% <ø> (-0.33%)`	⬇️

... and 41 files with indirect coverage changes

dhinf · 2020-06-25T14:34:59Z

dart-impl/mpi/src/dart_communication.c

 #include <alloca.h>

+/* the number of consecutive memcpy for local put/get before calling into MPI */
+#define NUM_CONSECUTIVE_MEMCPY 16


why 16 ? educated guess?

Yes, I'm open to other suggestions :D

at least document, that 16 is just an educated guess

bertwesarg · 2020-06-25T18:07:05Z

dart-impl/mpi/src/dart_communication.c

 #include <alloca.h>

+/* the number of consecutive memcpy for local put/get before calling into MPI */
+#define NUM_CONSECUTIVE_MEMCPY 16


at least document, that 16 is just an educated guess

bertwesarg · 2020-06-25T18:08:57Z

dart-impl/mpi/src/dart_communication.c

+#define NUM_CONSECUTIVE_MEMCPY 16
+
+/* number of performed local memcpy between calling into MPI */
+static _Thread_local int num_local_memcpy = 0;


this needs C11, is this already ensured by CMake? I also think its better to use <threads.h> and thread_local here.

and no = 0 needed

You're right, we don't officially require a C11 compiler. I am not going to open the discussion on whether we should abandon support for >20 year old compilers though...

bertwesarg · 2020-06-25T18:13:07Z

dart-impl/mpi/src/dart_communication.c

+    num_local_memcpy = 0;
    return DART_OK;
  }



get_shared_mem and put_shared_mem also call memcpy, are they also effected by this?

They do not, as that doesn't depend on progress triggered locally.

devreal · 2020-06-26T08:50:22Z

After having given this some more thought I modified the PR to always call into MPI for local memory accesses. If fast local access without progress is required the application should just dereference the native pointer provided by DASH. Once we are in DART we have lost the latency race anyway. Everything else just adds complexity.

bertwesarg · 2020-06-26T09:24:14Z

+1

devreal added the enhancement label Jun 25, 2020

devreal requested a review from bertwesarg June 25, 2020 14:06

dhinf reviewed Jun 25, 2020

View reviewed changes

bertwesarg reviewed Jun 25, 2020

View reviewed changes

DART-MPI: call into MPI for local put/get to ensure progress

151d79d

devreal force-pushed the local-memcpy-interval branch from 6105977 to 151d79d Compare June 26, 2020 08:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DART-MPI: call into MPI every once in a while for local put/get#712

DART-MPI: call into MPI every once in a while for local put/get#712
devreal wants to merge 1 commit intodevelopmentfrom
local-memcpy-interval

devreal commented Jun 25, 2020

Uh oh!

codecov bot commented Jun 25, 2020 •

edited

Loading

Uh oh!

dhinf Jun 25, 2020

Uh oh!

devreal Jun 25, 2020

Uh oh!

bertwesarg Jun 25, 2020

Uh oh!

bertwesarg Jun 25, 2020

Uh oh!

bertwesarg Jun 25, 2020

Uh oh!

bertwesarg Jun 25, 2020

Uh oh!

devreal Jun 26, 2020

Uh oh!

bertwesarg Jun 25, 2020

Uh oh!

devreal Jun 26, 2020

Uh oh!

devreal commented Jun 26, 2020

Uh oh!

bertwesarg commented Jun 26, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

devreal commented Jun 25, 2020

Uh oh!

codecov bot commented Jun 25, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

devreal commented Jun 26, 2020

Uh oh!

bertwesarg commented Jun 26, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov bot commented Jun 25, 2020 •

edited

Loading