Skip to content

Extract DeletionVector logic from PuffinFile#3491

Open
ebyhr wants to merge 1 commit into
apache:mainfrom
ebyhr:ebi/puffin-refactoring
Open

Extract DeletionVector logic from PuffinFile#3491
ebyhr wants to merge 1 commit into
apache:mainfrom
ebyhr:ebi/puffin-refactoring

Conversation

@ebyhr

@ebyhr ebyhr commented Jun 13, 2026

Copy link
Copy Markdown
Member

Rationale for this change

PuffinFile handles two tasks: format parsing (magic bytes, footer, blobs) and deletion vector domain logic (bitmap deserialization and PyArrow conversion).
This will become problematic when we introduce support for the NDV apache-datasketches-theta-v1 blob in the future.

Are these changes tested?

Yes

Are there any user-facing changes?

Yes - PuffinFile class user needs to call DeletionVector.

@ebyhr ebyhr force-pushed the ebi/puffin-refactoring branch 3 times, most recently from cf9a2ce to 374d25c Compare June 14, 2026 11:49
@ebyhr ebyhr force-pushed the ebi/puffin-refactoring branch from 374d25c to 74e0d7b Compare June 14, 2026 11:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant