Add regex patterns for split-archive coverage gaps #5

Merged
CodeX merged 2 commits from feat/regex-rar-coverage into main 2026-04-08 17:53:03 +02:00

2 Commits

Author SHA1 Message Date
CodeX 05f53e4cfc Replace with cleaner 2-pattern regex version
Replace the 5 separate regex patterns with a tighter 2-pattern version
that covers the same rar/zip gaps plus overflow letters (t, u, v, w,
x, y) not caught by the 5-pattern version, and adds 7z multi-volume
coverage. Both patterns are case-insensitive.

  regex:(?i)^.*\.7z\.\d+$     - 7z multi-volume (.7z.001+)
  regex:(?i)^.*\.[r-z]\d+$    - rar volumes, overflow, zip splits

The .partNN.rar pattern is removed because the existing *.rar literal
entry already covers it (ends-with match). The bare .NNN numeric split
pattern is removed because it was outside the stated scope of blocking
rar, zip, and 7z archives.
2026-04-08 17:25:40 +02:00
CodeX 80c8abcb12 Add regex patterns for split-archive coverage gaps
Upstream blacklist enumerates rar volume extensions only up to .r04,
.s09, .z04, with the .003 split missing entirely and no coverage for
the .partNN.rar multi-part naming convention. Releases with files like
.r05 and beyond slip through the file-level filter, and Malware
Blocker's queue-level cleanup is too slow to prevent qBittorrent from
downloading enough data to incur HnR penalties.

Add five regex patterns to the blacklist as manual local additions
preserved by the three-way merge:

  regex:^.*\.\d{3}$           - all 3-digit numeric splits (.000-.999)
  regex:^.*\.part\d+\.rar$    - multi-part rar new style
  regex:^.*\.r\d+$            - all rar volume numbers (old style)
  regex:^.*\.s\d+$            - rar volumes after r99
  regex:^.*\.z\d+$            - zip splits

These patterns are added directly to blacklist via the manual addition
escape hatch documented in README and Lists.md. The Whitelist stripped
log line will continue to function normally; new lines appear under
Custom preserved.
2026-04-08 16:45:10 +02:00