aboutsummaryrefslogtreecommitdiff
path: root/include
Commit message (Collapse)AuthorAgeFilesLines
* tree_based_check: fixes for debug build.Thorsten Töpper2025-08-141-1/+1
|
* tree_based_check: switch to unsigned valuesThorsten Töpper2025-08-141-7/+7
|
* tree_based_check: filter utility for hash listsThorsten Töpper2025-08-121-1/+1
|
* split_for_sort: Split a given file into bucketsThorsten Töpper2025-08-102-0/+116
The target bucket is decided based on the first X characters of a line. The bucket name gets a prefix defined as argument and can be sorted faster on weak hardware. Note: This is just a split alternative. Real world usage in a shell script with a file in which the first 10 characters are the equal in each line, the following 2 bytes are evaluated for splitting: split_for_sort TMPSFS 12 raw_data.txt for f in TMPSFS ; do sort -o "${f}_sorted" -u "${f}" done \# Rely on the argument resolution to go with lexical order cat TMPSFS*_sorted > sorted_data.txt rm TMPSFS*