Skip to content

Fix PPR sampler memory and labeled homogeneous ABLP#645

Merged
mkolodner-sc merged 31 commits into
mainfrom
mkolodner-sc/ppr_gs_memory
Jun 1, 2026
Merged

Fix PPR sampler memory and labeled homogeneous ABLP#645
mkolodner-sc merged 31 commits into
mainfrom
mkolodner-sc/ppr_gs_memory

Conversation

@mkolodner-sc
Copy link
Copy Markdown
Collaborator

@mkolodner-sc mkolodner-sc commented May 19, 2026

Summary

Makes the PPR sampler cheaper and fixes the labeled-homogeneous ABLP edge case that surfaced while exercising PPR through Graph Store.

Changes include:

  • Precompute PPR total-degree tensors by node type through DistDataset.degree_tensor.
  • Store degree tensors as int32 and share them across sampling workers instead of rebuilding/copying per worker.
  • Update DistPPRNeighborSampler to consume precomputed degree tensors directly.
  • Fix labeled homogeneous ABLP PPR sampling by only passing etype=None for true homogeneous graphs.
  • Fix PPR for GraphStore mode so that memory is shared for the degree tensor
  • Attach single-edge-type PPR outputs directly to homogeneous Data batches.
  • Expand PPR and degree unit coverage.

@mkolodner-sc
Copy link
Copy Markdown
Collaborator Author

/all_tests

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented May 28, 2026

GiGL Automation

@ 19:17:28UTC : 🔄 C++ Unit Test started.

@ 19:19:34UTC : ✅ Workflow completed successfully.

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented May 28, 2026

GiGL Automation

@ 19:17:29UTC : 🔄 Python Unit Test started.

@ 20:14:17UTC : ✅ Workflow completed successfully.

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented May 28, 2026

GiGL Automation

@ 19:17:30UTC : 🔄 E2E Test started.

@ 20:47:24UTC : ❌ Workflow failed.
Please check the logs for more details.

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented May 28, 2026

GiGL Automation

@ 19:17:30UTC : 🔄 Lint Test started.

@ 19:26:20UTC : ✅ Workflow completed successfully.

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented May 28, 2026

GiGL Automation

@ 19:17:31UTC : 🔄 Integration Test started.

@ 20:30:59UTC : ✅ Workflow completed successfully.

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented May 28, 2026

GiGL Automation

@ 19:17:31UTC : 🔄 Scala Unit Test started.

@ 19:28:22UTC : ✅ Workflow completed successfully.

@mkolodner-sc
Copy link
Copy Markdown
Collaborator Author

/e2e_test

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented May 28, 2026

GiGL Automation

@ 20:48:41UTC : 🔄 E2E Test started.

@ 22:34:48UTC : ❌ Workflow failed.
Please check the logs for more details.

@mkolodner-sc
Copy link
Copy Markdown
Collaborator Author

/e2e_test

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented May 28, 2026

GiGL Automation

@ 21:47:12UTC : 🔄 E2E Test started.

@ 23:07:38UTC : ❌ Workflow failed.
Please check the logs for more details.

@mkolodner-sc
Copy link
Copy Markdown
Collaborator Author

/e2e_test

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented May 28, 2026

GiGL Automation

@ 22:42:25UTC : 🔄 E2E Test started.

@ 24:51:17UTC : ❌ Workflow failed.
Please check the logs for more details.

@mkolodner-sc
Copy link
Copy Markdown
Collaborator Author

/e2e_test

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented May 29, 2026

GiGL Automation

@ 24:24:11UTC : 🔄 E2E Test started.

@ 01:47:47UTC : ❌ Workflow failed.
Please check the logs for more details.

@mkolodner-sc
Copy link
Copy Markdown
Collaborator Author

/e2e_test

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented May 29, 2026

GiGL Automation

@ 24:38:06UTC : 🔄 E2E Test started.

@ 02:04:12UTC : ✅ Workflow completed successfully.

Copy link
Copy Markdown
Collaborator

@kmontemayor2-sc kmontemayor2-sc left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks Matt! Did a first pass here, fwiw I feel like this could have been multiple PRs for the different fixes / etc but this pr is fine as-is.

Comment thread gigl/distributed/graph_store/shared_dist_sampling_producer.py Outdated
Comment thread gigl/distributed/utils/degree.py
Comment thread gigl/distributed/utils/degree.py Outdated
Comment thread gigl/distributed/utils/degree.py Outdated
Comment thread gigl/distributed/utils/neighborloader.py
Comment thread gigl/distributed/dist_ppr_sampler.py Outdated
Comment thread tests/unit/distributed/utils/degree_test.py Outdated
Comment thread gigl/distributed/dist_sampling_producer.py
Comment thread gigl/distributed/utils/neighborloader.py Outdated
Comment thread gigl/distributed/dist_dataset.py Outdated
@mkolodner-sc mkolodner-sc marked this pull request as ready for review June 1, 2026 07:20
@mkolodner-sc mkolodner-sc added this pull request to the merge queue Jun 1, 2026
@github-merge-queue github-merge-queue Bot removed this pull request from the merge queue due to failed status checks Jun 1, 2026
@mkolodner-sc mkolodner-sc added this pull request to the merge queue Jun 1, 2026
@mkolodner-sc mkolodner-sc removed this pull request from the merge queue due to a manual request Jun 1, 2026
@mkolodner-sc mkolodner-sc added this pull request to the merge queue Jun 1, 2026
Merged via the queue into main with commit 15b6b45 Jun 1, 2026
7 checks passed
@mkolodner-sc mkolodner-sc deleted the mkolodner-sc/ppr_gs_memory branch June 1, 2026 21:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants