[NGT] Backport - Implement new strategy for Tau Validation at HLT and RECO in Phase-2 by elenavernazza · Pull Request #51152 · cms-sw/cmssw

elenavernazza · 2026-06-09T07:53:02Z

PR description:

This PR introduces a new high-level validation strategy for hadronic tau reconstruction at both HLT and RECO in Phase-2.
The validation framework evaluates the reconstruction performance by matching GEN taus to reconstructed (HLT/RECO) taus and computing the following metrics:

Efficiency = fraction of GEN taus matched to at least one HLT/RECO tau
Fake rate = fraction of HLT/RECO taus not matched to any GEN tau
Split rate = fraction of GEN taus matched to more than one HLT/RECO tau
Duplicate rate = fraction of HLT/RECO taus matched to more than one GEN tau
$p_T$ / mass scale = mean of the response distribution (HLT/RECO divided by GEN)
$p_T$ / mass resolution = sigma/mean of the response distribution (HLT/RECO divided by GEN)

All metrics can be studied as a function of the tau kinematics ($p_T$, $\eta$, $\phi$, mass).
As opposed to the low-level validation, which relies on detailed hit-by-hit associations, this high-level validation adopts a simplified matching strategy based on a configurable spatial requirement (default: $\Delta R=0.3$). This complementary two-level validation approach was discussed in the Tau Algo Meeting (link).

Technical changes

This PR includes:

Re-introduction of HLTTauVal and HLTTauPostVal into the Phase-2 HLT validation and post-validation sequences
Re-organization of tau pre-validation, validation, and post-validation in RECO workflows for both Run-3 and Phase-2: adoption of standardized naming conventions, removal of deprecated and unmaintained sequences.

The core of the implementation is the new TauValidator.cc EDAnalyzer, which produces centralised DQM histograms for all defined metrics that can be used for more automatised RelVal studies. It supports both HLT and RECO collections, handling both RECO and PAT formats (required for Tau ID algorithms such as DeepTau, available only from the PAT step for the offline reconstruction).
Additionally, plotting utilities are provided to compare:

Different ID raw score cuts
Different ID working points
HLT vs RECO performance
Different ΔR matching thresholds (optional, disabled by default for faster workflows)

Some exemplary plots in the following:

All plots for 90K events from a TenTau sample in 150 PU can be found at this link.

Instruction to reproduce validation plots

Run reconstruction and harvesting from RelVal samples

HLT only (faster):

cmsDriver.py step2 -s L1P2GT,HLT:75e33,VALIDATION:@hltValidation -n -1 --nThreads 0 \
 --conditions auto:phase2_realistic_T35 --datatier GEN-SIM-DIGI-RAW,DQMIO \
 --customise SLHCUpgradeSimulations/Configuration/aging.customise_aging_1000 --eventcontent FEVTDEBUGHLT,DQMIO \
 --geometry ExtendedRun4D110 --era Phase2C17I13M9 --hltProcess HLTX --processName HLTX \
 --filein file:/eos/cms/store/relval/CMSSW_16_1_0_pre2/RelValTenTau_15_500/GEN-SIM-DIGI-RAW/PU_150X_mcRun4_realistic_v1_STD_Run4D110_PU-v1/2590000/01cbb197-f9d0-48a5-a634-c985f3b66373.root --fileout file:step2.root \
 --inputCommands="keep *, drop *_hlt*_*_HLT, drop triggerTriggerFilterObjectWithRefs_l1t*_*_HLT"

cmsDriver.py step3 -s HARVESTING:@hltValidation -n -1 \
 --conditions auto:phase2_realistic_T35 --mc --geometry ExtendedRun4D110 --era Phase2C17I13M9 \
 --filetype DQM --scenario pp --hltProcess HLTX --filein file:step2_inDQM.root --fileout file:step3.root

HLT + RECO (slower):

cmsDriver.py step3 -s RAW2DIGI,RECO,RECOSIM,PAT,VALIDATION:@phase2Validation -n 10 \
 --conditions auto:phase2_realistic_T35 --geometry ExtendedRun4D110 --era Phase2C17I13M9 \
 --datatier DQMIO --eventcontent DQM \
 --customise SLHCUpgradeSimulations/Configuration/aging.customise_aging_1000 \
 --customise_command "process.globalValidationHCAL = cms.Sequence(process.hcalSimHitsValidationSequence + process.hcalSimHitStudy)" \
 --filein file:01cbb197-f9d0-48a5-a634-c985f3b66373.root \
 --fileout file:step3.root 

cmsDriver.py step4 -s HARVESTING:@phase2Validation -n -1 \
 --conditions auto:phase2_realistic_T35 --mc --geometry ExtendedRun4D110 --era Phase2C17I13M9 \
 --filetype DQM --scenario pp --hltProcess HLTX --filein file:step3.root --fileout file:step4.root

Note: Before this PR, the HLT DeepTau ID was not persisted in the FEVTDEBUGHLT event content, so running only the RECO step on an old RelVal sample could result in empty distributions of the ID values at HLT.

Produce plots from DQM output

General validation plots:

run_tau_validation_plots.sh STEP

Choose STEP=HLT/RECO (or omit for both). By default, it will consider the validation obtained by requiring WP vs Jet >0 (CutWP_VSjet0); change folder name (SUB_DIR, OUTDIR_SUFFIX, LABEL_TEXT) if needed.

Comparison plots (ID / WP / $\Delta R$ scans):

run_tau_comparison_plots.sh MODE STEP

Choose MODE=ID/WP/DeltaR and STEP=HLT/RECO.

PR validation:

This PR has been extensively tested with the commands listed above, based on a TenTau RelVal sample. The configurations based on NGT scouting and HLT timing menus have also been successfully validated.
It has also been successfully validated on a general Phase-2 workflow 34434.0:

runTheMatrix.py -l 34434.0 -w upgrade -j 10 --ibeos --nEvents 10 -i all

The DQM plots are correctly produced but there are only few GEN taus in the tt-bar sample so the performance is not fully representative.

If this PR is a backport please specify the original PR and why you need to backport that PR. If this PR will be backported please specify to which release cycle the backport is meant for:

This PR is a backport of #51092, in order for the validation tool to be available for Run-3 workflows.

Co-authored-by: Spyros Merianos <spyros.merianos@cern.ch>

cmsbuild · 2026-06-09T07:53:36Z

cms-bot internal usage

cmsbuild · 2026-06-09T07:55:04Z

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-51152/49677

cmsbuild · 2026-06-09T07:55:29Z

A new Pull Request was created by @elenavernazza for CMSSW_17_0_X.

It involves the following packages:

Configuration/EventContent (operations)
HLTriggerOffline/Common (dqm)
HLTriggerOffline/Tau (dqm)
Validation/Configuration (dqm, simulation)
Validation/RecoTau (dqm)

@civanch, @cmsbuild, @ctarricone, @davidlange6, @fabiocos, @ftenchini, @gabrielmscampos, @kpedro88, @mandrenguyen, @mdhildreth, @rseidita can you please review it and eventually sign? Thanks.
@Martin-Grunewald, @apsallid, @denizsun, @fabiocos, @missirol, @mmusich, @mtosi, @rovere, @salimcerci this is something you requested to watch as well.
@ftenchini, @mandrenguyen, @sextonkennedy you are the release manager for this.

cms-bot commands are listed here

Backported from [NGT] Bug fix for tau prevalidation sequence in Run2 workflow #51198

mmusich · 2026-06-09T08:01:32Z

backport of #51092

mmusich · 2026-06-09T08:01:50Z

@cmsbuild, please test

elenavernazza · 2026-06-11T09:28:58Z

This additional commit contains the bug fix introduced in #51198

mmusich · 2026-06-11T09:29:23Z

test parameters:

workflows = ph2_hlt, 11.0, 281.0, 1311.0

mmusich · 2026-06-11T09:29:33Z

backport of #51198

mmusich · 2026-06-11T09:29:42Z

@cmsbuild, please test

cmsbuild · 2026-06-11T09:29:52Z

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-51152/49723

cmsbuild · 2026-06-11T09:30:08Z

Pull request #51152 was updated. @civanch, @ctarricone, @davidlange6, @fabiocos, @ftenchini, @gabrielmscampos, @kpedro88, @mandrenguyen, @mdhildreth, @rseidita can you please check and sign again.

cmsbuild · 2026-06-11T12:08:01Z

+1

Size: This PR adds an extra 28KB to repository
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-da07f6/53858/summary.html
COMMIT: d7968a1
CMSSW: CMSSW_17_0_X_2026-06-10-2300/el8_amd64_gcc13
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmssw/51152/53858/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

Summary:

You potentially removed 1 lines from the logs
ROOTFileChecks: Some differences in event products or their sizes found
Reco comparison results: 20 differences found in the comparisons
DQMHistoTests: Total files compared: 70
DQMHistoTests: Total histograms compared: 5054302
DQMHistoTests: Total failures: 53
DQMHistoTests: Total nulls: 3
DQMHistoTests: Total successes: 5054220
DQMHistoTests: Total skipped: 26
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 740142.462 KiB( 69 files compared)
DQMHistoSizes: changed ( 34434.7503,... ): 32181.719 KiB HLT/Tau
DQMHistoSizes: changed ( 34434.0,... ): 32174.297 KiB Tau/TauValidation
DQMHistoSizes: changed ( 34434.0,... ): 0.004 KiB MessageLogger/Errors
DQMHistoSizes: changed ( 34434.0,... ): 0.004 KiB MessageLogger/Warnings
Checked 297 log files, 252 edm output root files, 70 DQM output files
TriggerResults: found differences in 18 / 68 workflows

Max Memory Comparisons exceeding threshold

@cms-sw/core-l2 , I found 39 workflow step(s) with memory usage exceeding the error threshold:

Expand to see workflows ...

Error: Workflow 34434.0_TTbar_14TeV+Run4D121 step3 max memory diff 71.4 exceeds +/- 30.0 MiB
Error: Workflow 34434.0_TTbar_14TeV+Run4D121 step4 max memory diff 241.2 exceeds +/- 30.0 MiB
Error: Workflow 34434.75_TTbar_14TeV+Run4D121_HLT75e33Timing step2 max memory diff 35.9 exceeds +/- 30.0 MiB
Error: Workflow 34434.75_TTbar_14TeV+Run4D121_HLT75e33Timing step3 max memory diff 120.7 exceeds +/- 30.0 MiB
Error: Workflow 34434.7503_TTbar_14TeV+Run4D121_HLTHeterogeneousValid step3 max memory diff 120.7 exceeds +/- 30.0 MiB
Error: Workflow 34434.7503_TTbar_14TeV+Run4D121_HLTHeterogeneousValid step2 max memory diff 36.0 exceeds +/- 30.0 MiB
Error: Workflow 34434.751_TTbar_14TeV+Run4D121_HLT75e33TimingAlpaka step2 max memory diff 35.9 exceeds +/- 30.0 MiB
Error: Workflow 34434.751_TTbar_14TeV+Run4D121_HLT75e33TimingAlpaka step3 max memory diff 120.7 exceeds +/- 30.0 MiB
Error: Workflow 34434.7521_TTbar_14TeV+Run4D121_HLT75e33TimingTiclV5TrackLinkGNN step3 max memory diff 120.7 exceeds +/- 30.0 MiB
Error: Workflow 34434.7521_TTbar_14TeV+Run4D121_HLT75e33TimingTiclV5TrackLinkGNN step2 max memory diff 35.9 exceeds +/- 30.0 MiB
Error: Workflow 34434.753_TTbar_14TeV+Run4D121_HLT75e33TimingLegacyTracking step2 max memory diff 35.9 exceeds +/- 30.0 MiB
Error: Workflow 34434.753_TTbar_14TeV+Run4D121_HLT75e33TimingLegacyTracking step3 max memory diff 120.9 exceeds +/- 30.0 MiB
Error: Workflow 34434.754_TTbar_14TeV+Run4D121_HLT75e33TimingLegacyTrackingPatatrackQuads step2 max memory diff 35.9 exceeds +/- 30.0 MiB
Error: Workflow 34434.754_TTbar_14TeV+Run4D121_HLT75e33TimingLegacyTrackingPatatrackQuads step3 max memory diff 120.9 exceeds +/- 30.0 MiB
Error: Workflow 34434.755_TTbar_14TeV+Run4D121_HLT75e33TimingLST step3 max memory diff 120.7 exceeds +/- 30.0 MiB
Error: Workflow 34434.755_TTbar_14TeV+Run4D121_HLT75e33TimingLST step2 max memory diff 35.9 exceeds +/- 30.0 MiB
Error: Workflow 34434.756_TTbar_14TeV+Run4D121_HLT75e33TimingTrimmedTracking step3 max memory diff 120.7 exceeds +/- 30.0 MiB
Error: Workflow 34434.756_TTbar_14TeV+Run4D121_HLT75e33TimingTrimmedTracking step2 max memory diff 35.9 exceeds +/- 30.0 MiB
Error: Workflow 34434.757_TTbar_14TeV+Run4D121_HLT75e33TimingMkFitFit step3 max memory diff 120.7 exceeds +/- 30.0 MiB
Error: Workflow 34434.757_TTbar_14TeV+Run4D121_HLT75e33TimingMkFitFit step2 max memory diff 35.9 exceeds +/- 30.0 MiB
Error: Workflow 34434.758_TTbar_14TeV+Run4D121_HLT75e33TimingTiclBarrel step3 max memory diff 120.7 exceeds +/- 30.0 MiB
Error: Workflow 34434.758_TTbar_14TeV+Run4D121_HLT75e33TimingTiclBarrel step2 max memory diff 35.9 exceeds +/- 30.0 MiB
Error: Workflow 34434.7591_TTbar_14TeV+Run4D121_HLTPhase2WithNanoValid step2 max memory diff 35.5 exceeds +/- 30.0 MiB
Error: Workflow 34434.77_TTbar_14TeV+Run4D121_NGTScouting step2 max memory diff 35.9 exceeds +/- 30.0 MiB
Error: Workflow 34434.77_TTbar_14TeV+Run4D121_NGTScouting step3 max memory diff 120.9 exceeds +/- 30.0 MiB
Error: Workflow 34434.771_TTbar_14TeV+Run4D121_NGTScoutingAll step3 max memory diff 120.9 exceeds +/- 30.0 MiB
Error: Workflow 34434.771_TTbar_14TeV+Run4D121_NGTScoutingAll step2 max memory diff 36.0 exceeds +/- 30.0 MiB
Error: Workflow 34434.773_TTbar_14TeV+Run4D121_NGTScoutingWithNanoValid step2 max memory diff 35.5 exceeds +/- 30.0 MiB
Error: Workflow 34434.774_TTbar_14TeV+Run4D121_L1NGTScoutingWithNanoValid step2 max memory diff 231.8 exceeds +/- 30.0 MiB
Error: Workflow 34434.775_TTbar_14TeV+Run4D121_NGTScoutingCAExtensionMergeT5 step3 max memory diff 120.9 exceeds +/- 30.0 MiB
Error: Workflow 34434.775_TTbar_14TeV+Run4D121_NGTScoutingCAExtensionMergeT5 step2 max memory diff 35.9 exceeds +/- 30.0 MiB
Error: Workflow 34434.911_TTbar_14TeV+Run4D121_DD4hep step4 max memory diff 241.2 exceeds +/- 30.0 MiB
Error: Workflow 34434.911_TTbar_14TeV+Run4D121_DD4hep step3 max memory diff 71.4 exceeds +/- 30.0 MiB
Error: Workflow 34496.0_CloseByPGun_CE_E_Front_120um+Run4D121 step3 max memory diff 71.5 exceeds +/- 30.0 MiB
Error: Workflow 34496.0_CloseByPGun_CE_E_Front_120um+Run4D121 step4 max memory diff 241.2 exceeds +/- 30.0 MiB
Error: Workflow 34500.0_CloseByPGun_CE_H_Coarse_Scint+Run4D121 step3 max memory diff 71.5 exceeds +/- 30.0 MiB
Error: Workflow 34500.0_CloseByPGun_CE_H_Coarse_Scint+Run4D121 step4 max memory diff 241.2 exceeds +/- 30.0 MiB
Error: Workflow 34634.999_TTbar_14TeV+Run4D121PU_PMXS1S2PR step4 max memory diff 71.5 exceeds +/- 30.0 MiB
Error: Workflow 34634.999_TTbar_14TeV+Run4D121PU_PMXS1S2PR step5 max memory diff 241.2 exceeds +/- 30.0 MiB

elenavernazza and others added 13 commits June 2, 2026 09:09

Add back HLTTauVal sequence in Phase2

fe17f52

Add callable python for comparison plots

947cbe8

Add response distributions

7644257

Fix logy and logx options in plotting

0357faa

Add Tau Validation for Offline HPSPFTaus

792da7c

Re-organize plotting and add inverted argmument for fake rates

dd0342f

Co-authored-by: Spyros Merianos <spyros.merianos@cern.ch>

Add 2D efficiency plots

9e35c61

Persist HLT DeepTauProducer collection

c77dc37

Add comparison of scale and resolution HLT vs RECO

2219585

Remove DeltaR scanning from default validation

199b9d1

Small code format

61ac9bc

Comment folders in harvesting

18bd7c6

Small plotting changes

6cf1b4e

elenavernazza changed the title ~~[NGT] Implement new strategy for Tau Validation at HLT and RECO in Phase-2~~ [NGT] Backport - Implement new strategy for Tau Validation at HLT and RECO in Phase-2 Jun 9, 2026

cmsbuild added this to the CMSSW_17_0_X milestone Jun 9, 2026

cmsbuild added simulation-pending dqm-pending operations-pending pending-signatures tests-pending orp-pending code-checks-pending labels Jun 9, 2026

cmsbuild added code-checks-approved and removed code-checks-pending labels Jun 9, 2026

elenavernazza mentioned this pull request Jun 9, 2026

[NGT] Implement new strategy for Tau Validation at HLT and RECO in Phase-2 #51092

Merged

cmsbuild added operations-approved tests-approved backport-ok and removed operations-pending tests-started backport labels Jun 9, 2026

Fix tau prevalidation for run2 wf

d7968a1

cmsbuild added operations-pending tests-pending code-checks-pending and removed operations-approved tests-approved code-checks-approved labels Jun 11, 2026

cmsbuild added tests-started backport and removed tests-pending backport-ok labels Jun 11, 2026

cmsbuild added code-checks-approved and removed code-checks-pending labels Jun 11, 2026

cmsbuild added operations-approved tests-approved and removed operations-pending tests-started labels Jun 11, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NGT] Backport - Implement new strategy for Tau Validation at HLT and RECO in Phase-2#51152

[NGT] Backport - Implement new strategy for Tau Validation at HLT and RECO in Phase-2#51152
elenavernazza wants to merge 14 commits into
cms-sw:CMSSW_17_0_Xfrom
cms-ngt-hlt:TauValidationHLT_17_0_X

elenavernazza commented Jun 9, 2026 •

edited

Loading

Uh oh!

cmsbuild commented Jun 9, 2026 •

edited

Loading

Uh oh!

cmsbuild commented Jun 9, 2026

Uh oh!

cmsbuild commented Jun 9, 2026 •

edited

Loading

Uh oh!

mmusich commented Jun 9, 2026

Uh oh!

mmusich commented Jun 9, 2026

Uh oh!

elenavernazza commented Jun 11, 2026

Uh oh!

mmusich commented Jun 11, 2026

Uh oh!

mmusich commented Jun 11, 2026

Uh oh!

mmusich commented Jun 11, 2026

Uh oh!

cmsbuild commented Jun 11, 2026

Uh oh!

cmsbuild commented Jun 11, 2026

Uh oh!

cmsbuild commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

elenavernazza commented Jun 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR description:

Technical changes

Instruction to reproduce validation plots

PR validation:

If this PR is a backport please specify the original PR and why you need to backport that PR. If this PR will be backported please specify to which release cycle the backport is meant for:

Uh oh!

cmsbuild commented Jun 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cmsbuild commented Jun 9, 2026

Uh oh!

cmsbuild commented Jun 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mmusich commented Jun 9, 2026

Uh oh!

mmusich commented Jun 9, 2026

Uh oh!

elenavernazza commented Jun 11, 2026

Uh oh!

mmusich commented Jun 11, 2026

Uh oh!

mmusich commented Jun 11, 2026

Uh oh!

mmusich commented Jun 11, 2026

Uh oh!

cmsbuild commented Jun 11, 2026

Uh oh!

cmsbuild commented Jun 11, 2026

Uh oh!

cmsbuild commented Jun 11, 2026

Comparison Summary

Max Memory Comparisons exceeding threshold

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

elenavernazza commented Jun 9, 2026 •

edited

Loading

cmsbuild commented Jun 9, 2026 •

edited

Loading

cmsbuild commented Jun 9, 2026 •

edited

Loading