Skip to content

Add option to run debug memory backend without issuing errors #4882

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 3 commits into
base: master
Choose a base branch
from

Conversation

victor-decaria-nnl
Copy link
Contributor

@victor-decaria-nnl victor-decaria-nnl commented Jun 5, 2025

This adds a runtime option so that the debug backend won't use mprotect statements if MFEM_MMU_SILENT is in the environment.

The motivation for this is if you're debugging but you suspect that the mprotect logic is causing false postives, you can disable the mprotect calls and still emulate the host-device transfers.

I thought about seeing if I could make debug-silent an acceptable string but wasn't sure where that plugged in and this is similar to the way MFEM_MMU_PROTECT_ERROR is currently used.

PR Author Editor Reviewers Assignment Approval Merge
#4882 @victor-decaria-nnl @tzanio @camierjs + @v-dobrev 6/8/25 ⌛due 6/22/25 ⌛due 6/29/25
PR Checklist
  • Code builds.
  • Code passes make style.
  • Update CHANGELOG:
    • Is this a new feature users need to be aware of? New or updated example or miniapp?
    • Does it make sense to create a new section in the CHANGELOG to group with other related features?
  • Update INSTALL:
    • Had a new optional library been added? If so, what range of versions of this library are required? (Make sure the external library is compatible with our BSD license, e.g. it is not licensed under GPL!)
    • Have the version ranges for any required or optional libraries changed?
    • Does make or cmake have a new target?
    • Did the requirements or the installation process change? (rare)
  • Update continuous integration server configurations if necessary (e.g. with new version requirements for each of MFEM's dependencies)
    • .github
    • .appveyor.yml
  • Update .gitignore:
    • Check if make distclean; git status shows any files that were generated from the source by the project (not an IDE) but we don't want to track in the repository.
    • Add new patterns (just for the new files above) and re-run the above test.
  • New examples:
    • All sample runs at the top of the example source file work.
    • Update examples/makefile:
      • Add the example code to the appropriate SEQ_EXAMPLES and PAR_EXAMPLES variables.
      • Add any files generated by it to the clean target.
      • Add the example binary and any files generated by it to the top-level .gitignore file.
    • Update examples/CMakeLists.txt:
      • Add the example code to the ALL_EXE_SRCS variable.
      • Make sure THIS_TEST_OPTIONS is set correctly for the new example.
    • List the new example in doc/CodeDocumentation.dox.
    • If new examples directory (e.g.examples/pumi), list it in doc/CodeDocumentation.conf.in
    • Companion pull request for documentation in mfem/web repo:
      • Update or add example-specific documentation, see e.g. the src/examples.md.
      • Add the description, labels and screenshots in src/examples.md and src/img.
      • In examples.md, list the example under the appropriate categories, add new categories if necessary.
      • Add a short description of the example in the "Extensive Examples" section of features.md.
  • New miniapps:
    • All sample runs at the top of the miniapp source file work.
    • Update top-level makefile and makefile in corresponding miniapp directory.
    • Add the miniapp binary and any files generated by it to the top-level .gitignore file.
    • Update CMake build system:
      • Update the CMakeLists.txt file in the miniapps directory, if the new miniapp is in a new directory.
      • Add/update the CMakeLists.txt file in the new miniapp directory.
      • Consider adding a new test for the new miniapp.
    • List the new miniapp in doc/CodeDocumentation.dox
    • If new miniapps directory (e.g.miniapps/nurbs), add it to MINIAPP_SUBDIRS in the makefile.
    • If new miniapps directory (e.g.miniapps/nurbs), list it in doc/CodeDocumentation.conf.in
    • Companion pull request for documentation in mfem/web repo:
      • Update or add miniapp-specific documentation, see e.g. the src/meshing.md and src/electromagnetics.md files.
      • Add the description, labels and screenshots in src/examples.md and src/img.
      • The miniapps go at the end of the page, and are usually listed only under a specific "Application (PDE)" category.
      • Add a short description of the miniapp in the "Extensive Examples" section of features.md.
  • New capability:
    • All new public, protected, and private classes, methods, data members, and functions have full Doxygen-style documentation in source comments. Documentation should include descriptions of member data, function arguments and return values, template parameters, and prerequisites for calling new functions.
    • Pointer arguments and return values must specify whether ownership is being transferred or lent with the call.
    • Any new functions should include descriptions of their intended use e.g. for internal use only, user-facing, etc., along with references to example code whenever possible/appropriate.
    • Consider adding new sample runs in existing examples to highlight the new capability.
    • Consider saving cool simulation pictures with the new capability in the Confluence gallery (LLNL only) or submitting them, via pull request, to the gallery section of the mfem/web repo.
    • If this is a major new feature, consider mentioning it in the short summary inside README (rare).
    • List major new classes in doc/CodeDocumentation.dox (rare).
  • Update this checklist, if the new pull request affects it.
  • Run make unittest to make sure all unit tests pass.
  • Run the tests in tests/scripts.
  • (LLNL only) After merging:
    • Update internal tests to include the new features.

@victor-decaria-nnl victor-decaria-nnl self-assigned this Jun 5, 2025
@victor-decaria-nnl victor-decaria-nnl marked this pull request as ready for review June 5, 2025 15:53
@tzanio
Copy link
Member

tzanio commented Jun 8, 2025

This PR is now under review (see the table in the PR description). To help with the review process, please do not force push to the branch.

@tzanio tzanio requested a review from v-dobrev June 8, 2025 23:35
@tzanio tzanio added the GPU label Jun 8, 2025
@github-project-automation github-project-automation bot moved this to Review Now in Pull Requests Jun 8, 2025
@tzanio tzanio added this to the mfem-4.9 milestone Jun 8, 2025
@camierjs
Copy link
Member

camierjs commented Jun 10, 2025

Thank you @victor-decaria-nnl for bringing this option: I can see the use of it.

If we want to silence the MMU and skip all the mprotect calls, we can also use the StdHostMemorySpace and StdDeviceMemorySpace while selecting the memory controllers in NewHostCtrl:

case MT::HOST_DEBUG:
   if (GetEnv("MFEM_MMU_STD")) { return new StdHostMemorySpace(); }
   return new MmuHostMemorySpace();

and

case MT::DEVICE_DEBUG:
   if (GetEnv("MFEM_MMU_STD")) { return new StdDeviceMemorySpace(); }
   return new MmuDeviceMemorySpace();

It allows you to skip all the address computations, which overall gives a good speedup.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: Review Now
Development

Successfully merging this pull request may close these issues.

4 participants