git-mirror

mirror of https://github.com/git/git.git synced 2025-12-23 12:14:22 +01:00

Author	SHA1	Message	Date
Junio C Hamano	3b7671e2ea	Merge branch 'ps/clar-integers' into seen Import newer version of "clar", unit testing framework. Comments? * ps/clar-integers: gitattributes: disable blank-at-eof errors for clar test expectations t/unit-tests: demonstrate use of integer comparison assertions t/unit-tests: update clar to 39f11fe	2025-12-23 12:19:53 +09:00
Patrick Steinhardt	2e53d29f53	t/unit-tests: demonstrate use of integer comparison assertions The clar project has introduced a couple of new assertions that perform relative integer comparisons, like "greater than" or "less or equal". Adapt the reftable-record unit tests to demonstrate their usage. Signed-off-by: Patrick Steinhardt <ps@pks.im> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-12-07 07:25:16 +09:00
Patrick Steinhardt	d5e4aef358	t/unit-tests: update clar to 39f11fe Update clar to commit 39f11fe (Merge pull request #131 from pks-gitlab/pks-integer-double-evaluation, 2025-12-05). This commit includes the following changes relevant to Git: - There are now typesafe integer comparison functions. Furthermore, the range of comparison functions has been included to also have relative comparisons, like "greater than". - There is a new `cl_failf()` macro that allows the caller to specify an error message with formatting directives. - The TAP format has been fixed to correctly terminate YAML blocks with "...\n" instead of "---\n". Note that we already had a `cl_failf()` function declared in our own sources. This function is equivalent to the upstreamed function, so we can simply drop it now. Signed-off-by: Patrick Steinhardt <ps@pks.im> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-12-07 07:25:15 +09:00
Jeff King	b5b6c11a70	parse: add functions for parsing from non-string buffers If you have a buffer that is not NUL-terminated but want to parse an integer, there aren't many good options. If you use strtol() and friends, you risk running off the end of the buffer if there is no non-digit terminating character. And even if you carefully make sure that there is such a character, ASan's strict-string-check mode will still complain. You can copy bytes into a temporary buffer, terminate it, and then call strtol(), but doing so adds some pitfalls (like making sure you soak up whitespace and leading +/- signs, and reporting overflow for overly long input). Or you can hand-parse the digits, but then you need to take some care to handle overflow (and again, whitespace and +/- signs). These things aren't impossible to do right, but it's error-prone to have to do them in every spot that wants to do such parsing. So let's add some functions which can be used across the code base. There are a few choices regarding the interface and the implementation. First, the implementation: - I went with with parsing the digits (rather than buffering and passing to libc functions). It ends up being a similar amount of code because we have to do some parsing either way. And likewise overflow detection depends on the exact type the caller wants, so we either have to do it by hand or write a separate wrapper for strtol(), strtoumax(), and so on. - Unsigned overflow detection is done using the same techniques as in unsigned_add_overflows(), etc. We can't use those macros directly because our core function is type-agnostic (so the caller passes in the max value, rather than us deriving it on the fly). This is similar to how git_parse_int(), etc, work. - Signed overflow detection assumes that we can express a negative value with magnitude one larger than our maximum positive value (e.g., -128..127 for a signed 8-bit value). I doubt this is guaranteed by the standard, but it should hold in practice, and we make the same assumption in git_parse_int(), etc. The nice thing about this is that we can derive the range from the number of bits in the type. For ints, you obviously could use INT_MIN..INT_MAX, but for an arbitrary type, we can use maximum_signed_value_of_type(). - I didn't bother with handling bases other than 10. It would complicate the code, and I suspect it won't be needed. We could probably retro-fit it later without too much work, if need be. For the interface: - What do we call it? We have git_parse_int() and friends, which aim to make parsing less error-prone. And in some ways, these are just buffer (rather than string) versions of those functions. But not entirely. Those functions are aimed at parsing a single user-facing value. So they accept a unit prefix (e.g., "10k"), which we won't always want. And they insist that the whole string is consumed (rather than passing back an "end" pointer). We also have strtol_i() and strtoul_ui() wrappers, which try to make error handling simpler (especially around overflow), but mostly behave like their libc counterparts. These also don't pass out an end pointer, though. So I started a new namespace, "parse_<type>_from_buf". - Like those other functions above, we use an out-parameter to store the result, which lets us return an error code directly. This avoids the complicated errno dance for detecting overflow that you get with strtol(). What should the error code look like? git_parse_int() uses a bool for success/failure. But strtol_ui() uses the syscall-like "0 is success, -1 is error" convention. I went with the bool approach here. Since the names are closest to those functions, I thought it would cause the least confusion. - Unlike git_parse_signed() and friends, we do not insist that the entire buffer be consumed. For parsing a specific standalone string that makes sense, but within an unterminated buffer you are much more likely to be parsing multiple fields from a larger data set. We pass out an "end" pointer the same way strtol() does. Another option is to accept the input as an in-out parameter and advance the pointer ourselves (and likewise shrink the length pointer). That would let you do something like: if (!parse_int_from_buf(&p, &len, &out)) return error(...); /* "p" and "len" were adjusted automatically / if (!len \|\| p++ != ' ') return error(...); That saves a few lines of code in some spots, but requires a few more in others (depending on whether the caller has a length in the first place or is using an end pointer). Of the two callers I intend to immediately convert, we have one of each type! I went with the strtol() approach as flexible and time-tested. - We could likewise take the input buffer as two pointers (start and end) rather than a pointer and a length. That again makes life easier for some callers and harder for others. I stuck with pointer and length as the more usual interface. - What happens when a caller passes in a NULL end pointer? This is allowed by strtol(). But I think it's often a sign of a lurking bug, because there's no way to know how much was consumed (and even if a caller wants to assume everything is consumed, you have no way to verify it). So it is simply an error in this interface (you'd get a segfault). I am tempted to say that if the end pointer is NULL the functions could confirm that the entire buffer was consumed, as a convenience. But that felt a bit magical and surprising. Like git_parse_*(), there is a generic signed/unsigned helper, and then we can add type-specific helpers on top. I've added an int helper here to start, and we'll add more as we convert callers. Signed-off-by: Jeff King <peff@peff.net> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-11-30 10:03:43 -08:00
Junio C Hamano	536d284f3b	Merge branch 'jk/ci-windows-meson-test-fix' "Windows+meson" job at the GitHub Actions CI was hard to debug, as it did not show and save failed test artifacts, which has been corrected. * jk/ci-windows-meson-test-fix: ci(windows-meson-test): handle options and output like other test jobs unit-test: ignore --no-chain-lint	2025-11-26 10:32:43 -08:00
Junio C Hamano	a5d5c50160	Merge branch 'jx/repo-struct-utf8width-fix' The "git repo structure" subcommand tried to align its output but mixed up byte count and display column width, which has been corrected. * jx/repo-struct-utf8width-fix: builtin/repo: fix table alignment for UTF-8 characters t/unit-tests: add UTF-8 width tests for CJK chars	2025-11-24 15:46:41 -08:00
Jeff King	e96105aa17	unit-test: ignore --no-chain-lint In the same spirit as `9faf3963b6` (t: introduce compatibility options to clar-based tests, 2024-12-13), we should ignore --no-chain-lint passed to our clar tests, since it may appear in GIT_TEST_OPTS to be used with other tests. This is particularly important on Windows CI, where --no-chain-lint is added to the test options by default, and the meson build will pass all options to the unit tests. The only reason our meson Windows CI job does not run into this currently is that it is not respecting GIT_TEST_OPTS at all! So ignoring this option is a prerequisite to fixing that situation. Signed-off-by: Jeff King <peff@peff.net> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-11-18 09:45:28 -08:00
Jiang Xin	7a03a10a3a	builtin/repo: fix table alignment for UTF-8 characters The output table from "git repo structure" is misaligned when displaying UTF-8 characters (e.g., non-ASCII glyphs). E.g.: \| 仓库结构 \| 值 \| \| -------------- \| ---- \| \| * 引用 \| \| \| * 计数 \| 67 \| The previous implementation used simple width formatting with printf() which didn't properly handle multi-byte UTF-8 characters, causing misaligned table columns when displaying repository structure information. This change modifies the stats_table_print_structure function to use strbuf_utf8_align() instead of basic printf width specifiers. This ensures proper column alignment regardless of the character encoding of the content being displayed. Also add test cases for strbuf_utf8_align(), a function newly introduced in "builtin/repo.c". Signed-off-by: Jiang Xin <worldhello.net@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-11-16 16:04:24 -08:00
Jiang Xin	878fef8ebf	t/unit-tests: add UTF-8 width tests for CJK chars The file "builtin/repo.c" uses utf8_strwidth() to calculate the display width of UTF-8 characters in a table, but the resulting output is still misaligned. Add test cases for both utf8_strwidth and utf8_strnwidth to verify that they correctly compute the display width for UTF-8 characters. Also updated the build configuration in Makefile and meson.build to include the new test suite in the build process. Signed-off-by: Jiang Xin <worldhello.net@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-11-16 16:04:24 -08:00
Karthik Nayak	e35155588a	reftable/stack: add function to check if optimization is required The reftable backend performs auto-compaction as part of its regular flow, which is required to keep the number of tables part of a stack at bay. This allows it to stay optimized. Compaction can also be triggered voluntarily by the user via the 'git pack-refs' or the 'git refs optimize' command. However, currently there is no way for the user to check if optimization is required without actually performing it. Extract out the heuristics logic from 'reftable_stack_auto_compact()' into an internal function 'update_segment_if_compaction_required()'. Then use this to add and expose `reftable_stack_compaction_required()` which will allow users to check if the reftable backend can be optimized. Signed-off-by: Karthik Nayak <karthik.188@gmail.com> Acked-by: Patrick Steinhardt <ps@pks.im> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-11-10 09:28:47 -08:00
Junio C Hamano	f50f046794	Merge branch 'kn/reftable-consistency-checks' The reftable backend learned to sanity check its on-disk data more carefully. * kn/reftable-consistency-checks: refs/reftable: add fsck check for checking the table name reftable: add code to facilitate consistency checks fsck: order 'fsck_msg_type' alphabetically Documentation/fsck-msgids: remove duplicate msg id reftable: check for trailing newline in 'tables.list' refs: move consistency check msg to generic layer refs: remove unused headers	2025-10-13 22:00:35 -07:00
Karthik Nayak	f644206377	reftable: check for trailing newline in 'tables.list' In the reftable format, the 'tables.list' file contains a newline separated list of tables. While we parse this file, we do not check or care about the last newline. Tighten the parser in `parse_names()` to return an appropriate error if the last newline is missing. This requires modification to `parse_names()` to now return the error while accepting the output as a third argument. Signed-off-by: Karthik Nayak <karthik.188@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-10-07 09:22:57 -07:00
Junio C Hamano	347af012db	Merge branch 'ps/clar-updates' Import a newer version of the clar unit testing framework. * ps/clar-updates: t/unit-tests: update to 10e96bc t/unit-tests: update clar to fcbed04	2025-09-29 11:40:33 -07:00
Patrick Steinhardt	93dbb6b3c5	t/unit-tests: update to 10e96bc Update to 10e96bc (Merge pull request #127 from pks-gitlab/pks-ci-improvements, 2025-09-22). This commit includes a couple of changes: - The GitHub CI has been updated to include a 32 bit CI job. Furthermore, the jobs now compile with "-Werror" and more warnings enabled. - An issue was addressed where `uintptr_t` is not available on NonStop [1]. - The clar selftests have been restructured so that it is now possible to add small test suites more readily. This was done to add tests for the above addressed issue, where we now use "%p" to print pointers in a platform dependent way. - An issue was addressed where the test output had a trailing whitespace with certain output formats, which caused whitespace issues in the test expectation files. [1]: <01c101dc2842$38903640$a9b0a2c0$@nexbridge.com> Reported-by: Randall S. Becker <rsbecker@nexbridge.com> Signed-off-by: Patrick Steinhardt <ps@pks.im> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-09-22 10:09:03 -07:00
Patrick Steinhardt	e7f04f651a	t/unit-tests: update clar to fcbed04 Update clar to fcbed04 (Merge pull request #123 from pks-gitlab/pks-sandbox-ubsan, 2025-09-10). The most significant changes since the last version include: - Fixed platform support for HP-UX. - Fixes for how clar handles the `-q` flag. - A couple of leak fixes for reported clar errors. - A new `cl_invoke()` function that retains line information. - New infrastructure to create temporary directories. - Improved printing of error messages so that all lines are now properly indented. - Proper selftests for the clar. Most of these changes are somewhat irrelevant to us, but neither do we have to adjust to any of these changes, either. What _is_ interesting to us though is especially the fixed support for HP-UX, and eventually we may also want to use `cl_invoke()`. Signed-off-by: Patrick Steinhardt <ps@pks.im> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-09-11 09:08:49 -07:00
Junio C Hamano	00c2c50ea6	Merge branch 'ps/reftable-libgit2-cleanup' Code clean-ups. * ps/reftable-libgit2-cleanup: refs/reftable: always reload stacks when creating lock reftable: don't second-guess errors from flock interface reftable/stack: handle outdated stacks when compacting reftable/stack: allow passing flags to `reftable_stack_add()` reftable/stack: fix compiler warning due to missing braces reftable/stack: reorder code to avoid forward declarations reftable/writer: drop Git-specific `QSORT()` macro reftable/writer: fix type used for number of records	2025-08-29 09:44:36 -07:00
Junio C Hamano	109c3df14c	Merge branch 'tc/diff-tree-max-depth' "git diff-tree" learned "--max-depth" option. * tc/diff-tree-max-depth: diff: teach tree-diff a max-depth parameter within_depth: fix return for empty path combine-diff: zero memory used for callback filepairs	2025-08-25 14:22:01 -07:00
Junio C Hamano	971ba42dd4	Merge branch 'jc/string-list-split' string_list_split() family of functions have been extended to simplify common use cases. jc/string-list-split: string-list: split-then-remove-empty can be done while splitting string-list: optionally omit empty string pieces in string_list_split() diff: simplify parsing of diff.colormovedws string-list: optionally trim string pieces split by string_list_split() string-list: unify string_list_split* functions string-list: align string_list_split() with its _in_place() counterpart string-list: report programming error with BUG	2025-08-21 13:46:59 -07:00
Patrick Steinhardt	178c588500	reftable/stack: allow passing flags to `reftable_stack_add()` The `reftable_stack_add()` function is a simple wrapper to lock the stack, add records to it via a callback and then commit the result. One problem with it though is that it doesn't accept any flags for creating the addition. This makes it impossible to automatically reload the stack in case it was modified before we managed to lock the stack. Add a `flags` field to plug this gap and pass it through accordingly. For now this new flag won't be used by us, but it will be used by libgit2. Signed-off-by: Patrick Steinhardt <ps@pks.im> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-08-12 07:40:59 -07:00
Toon Claes	2a43e0e550	within_depth: fix return for empty path The within_depth() function is used to check whether pathspecs limited by a max-depth parameter are acceptable. It takes a path to check, a maximum depth, and a "base" depth. It counts the components in the path (by counting slashes), adds them to the base, and compares them to the maximum. However, if the base does not have any slashes at all, we always return `true`. If the base depth is 0, then this is correct; no matter what the maximum is, we are always within it. However, if the base depth is greater than 0, then we might return an erroneous result. This ends up not causing any user-visible bugs in the current code. The call sites in dir.c always pass a base depth of 0, so are unaffected. But tree_entry_interesting() uses this function differently: it will pass the prefix of the current entry, along with a `1` if the entry is a directory, in essence checking whether items inside the entry would be of interest. It turns out not to make a difference in behavior, but the reasoning is complex. Given a tree like: file a/file a/b/file walking the tree and calling tree_entry_interesting() will yield the following results: (with max_depth=0): file: yes a: yes a/file: no a/b: no (with max_depth=1): file: yes a: yes a/file: yes a/b: no So we have inconsistent behavior in considering directories interesting. If they are at the edge of our depth but at the root, we will recurse into them, but then find all of their entries uninteresting (e.g., in the first case, we will look at "a" but find "a/" uninteresting). But if they are at the edge of our depth and not at the root, then we will not recurse (in the second example, we do not even bother entering "a/b"). This turns out not to matter because the only caller which uses max-depth pathspecs is cmd_grep(), which only cares about blob entries. From its perspective, it is exactly the same to not recurse into a subtree, or to recurse and find that it contains no matching entries. Not recursing is merely an optimization. It is debatable whether tree_entry_interesting() should consider such an entry interesting. The only caller does not care if it sees the tree itself, and can benefit from the optimization. But if we add a "max-depth" limiter to regular diffs, then a diff with DIFF_OPT_TREE_IN_RECURSIVE would probably want to show the tree itself, but not what it contains. This patch just fixes within_depth(), which means we consider such entries uninteresting (and makes the current caller happy). If we want to change that in the future, then this fix is still the correct first step, as the current behavior is simply inconsistent. This has the effect the function tree_entry_interesting() now behaves like following on the first example: (with max_depth=0): file: yes a: no a/file: no a/b: no Meaning we won't step in "a/" no more to realize all "a/" entries are uninterested, but we stop at the tree entry itself. Based-on-patch-by: Jeff King <peff@peff.net> Signed-off-by: Toon Claes <toon@iotcl.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-08-07 15:29:34 -07:00
Junio C Hamano	aa4fb2485c	Merge branch 'dl/squelch-maybe-uninitialized' Squelch false-positive compiler warning. * dl/squelch-maybe-uninitialized: t/unit-tests/clar: fix -Wmaybe-uninitialized with -Og remote: bail early from set_head() if missing remote name	2025-08-07 08:14:38 -07:00
Denton Liu	3a7e783d9c	t/unit-tests/clar: fix -Wmaybe-uninitialized with -Og When building with -Og on gcc 15.1.1, the build produces a warning. In practice, though, this cannot be hit because `exact` acts as a guard and that variable can only be set after `matchlen` is already initialized Assign a default value to `matchlen` so that the warning is silenced. Signed-off-by: Denton Liu <liu.denton@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-08-05 08:22:53 -07:00
Junio C Hamano	c4c628f661	Merge branch 'ps/meson-clar-decls-fix' Build fix. * ps/meson-clar-decls-fix: meson: ensure correct "clar-decls.h" header is used	2025-08-04 08:10:34 -07:00
Junio C Hamano	27531efa41	string-list: optionally omit empty string pieces in string_list_split*() Teach the unified split_string() machinery a new flag bit, STRING_LIST_SPLIT_NONEMPTY, to cause empty split pieces to be omitted from the resulting string list. Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-08-02 22:34:45 -07:00
Junio C Hamano	5764549741	string-list: optionally trim string pieces split by string_list_split*() Teach the unified split_string() to take an optional "flags" word, and define the first flag STRING_LIST_SPLIT_TRIM to cause the split pieces to be trimmed before they are placed in the string list. Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-08-02 22:34:32 -07:00
Junio C Hamano	9f6dfe43c8	string-list: align string_list_split() with its _in_place() counterpart The string_list_split_in_place() function was updated by `52acddf3` (string-list: multi-delimiter `string_list_split_in_place()`, 2023-04-24) to take more than one delimiter characters, hoping that we can later use it to replace our uses of strtok(). We however did not make a matching change to the string_list_split() function, which is very similar. Before giving both functions more features in future commits, allow string_list_split() to also take more than one delimiter characters to make them closer to each other. Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-08-02 22:29:27 -07:00
Junio C Hamano	80b80162fd	Merge branch 'sk/reftable-clarify-tests' The reftable unit tests are now ported to the "clar" unit testing framework. * sk/reftable-clarify-tests: t/unit-tests: finalize migration of reftable-related tests t/unit-tests: convert reftable stack test to use clar t/unit-tests: convert reftable record test to use clar t/unit-tests: convert reftable readwrite test to use clar t/unit-tests: convert reftable table test to use clar t/unit-tests: convert reftable pq test to use clar t/unit-tests: convert reftable merged test to use clar t/unit-tests: convert reftable block test to use clar t/unit-tests: convert reftable basics test to use clar test framework t/unit-tests: implement clar specific reftable test helper functions	2025-08-01 11:27:14 -07:00
Patrick Steinhardt	5247da07b8	meson: ensure correct "clar-decls.h" header is used The "clar-decls.h" header gets generated by us to extract prototypes of unit test functions from our clar-based tests. This generated file is then written into "t/unit-tests/" and included via "unit-test.h". The intent of all this is that we can keep "-Wmissing-prototype" warnings enabled. If we had that warning disabled, it would be easy to miss in case any of the non-static functions had a typo in its name and thus wasn't picked up by our test case extractor. Including the file directly has a big downside though: if a source tree was built both with our Makefile and with Meson, then the Meson build would include the "clar-decls.h" file from our Makefile. And if those are out of sync we get compiler errors. We already fixed a similar issue in `4771501c0a` (meson: ensure correct version-def.h is used, 2025-01-14). Let's do the same and pass the absolute path to "clar-decls.h" via a preprocessor define. Signed-off-by: Patrick Steinhardt <ps@pks.im> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-07-29 08:50:35 -07:00
Junio C Hamano	0f6e5037d4	Merge branch 'rs/pop-recent-commit-with-prio-queue' The pop_most_recent_commit() function can have quite expensive worst case performance characteristics, which has been optimized by using prio-queue data structure. * rs/pop-recent-commit-with-prio-queue: commit: use prio_queue_replace() in pop_most_recent_commit() prio-queue: add prio_queue_replace() commit: convert pop_most_recent_commit() to prio_queue	2025-07-28 12:02:34 -07:00
Seyi Kuforiji	9bbc981c6f	t/unit-tests: finalize migration of reftable-related tests The old `lib-reftable.{c,h}` implemented helper functions for our homegrown unit-testing framework. As part of migrating reftable-related tests to the Clar framework, Clar-specific versions of these functions in `lib-reftable-clar.{c,h}` were introduced. Now that all test files using these helpers have been converted to Clar, we can safely remove the original `lib-reftable.{c,h}` and rename the Clar- specific versions back to `lib-reftable.{c,h}`. This restores a clean and consistent naming scheme for shared test utilities. Finally, update our build system to reflect the changes made and remove redundant code related to the reftable tests and our old homegrown unit-testing setup. `test-lib.{c,h}` remains unchanged in our build system as some files particularly `t/helper/test-example-tap.c` depends on it in order to run, and removing that would be beyond the scope of this patch. Signed-off-by: Seyi Kuforiji <kuforiji98@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-07-24 11:46:04 -07:00
Seyi Kuforiji	1cfd187fc1	t/unit-tests: convert reftable stack test to use clar Adapt reftable stack test file to use clar by using clar assertions where necessary. This marks the end of all unit tests migrated away from the `unit-tests/t-.c` pattern, there are no longer any files matching that glob. Remove the sanity check for `t-.c` files to prevent Meson configuration errors during CI and local builds. Signed-off-by: Seyi Kuforiji <kuforiji98@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-07-24 11:46:04 -07:00
Seyi Kuforiji	2596bef584	t/unit-tests: convert reftable record test to use clar Adapt reftable record test file to use clar by using clar assertions where necessary. Signed-off-by: Seyi Kuforiji <kuforiji98@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-07-24 11:46:03 -07:00
Seyi Kuforiji	ee0a88dadb	t/unit-tests: convert reftable readwrite test to use clar Adapt reftable readwrite test file to use clar by using clar assertions where necessary. Signed-off-by: Seyi Kuforiji <kuforiji98@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-07-24 11:46:03 -07:00
Seyi Kuforiji	18a992b7b7	t/unit-tests: convert reftable table test to use clar Adapt reftable table test file to use clar by using clar assertions where necessary. Signed-off-by: Seyi Kuforiji <kuforiji98@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-07-24 11:46:03 -07:00
Seyi Kuforiji	a0aaa85c0c	t/unit-tests: convert reftable pq test to use clar Adapt reftable priority queue test file to use clar by using clar assertions where necessary. Signed-off-by: Seyi Kuforiji <kuforiji98@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-07-24 11:46:03 -07:00
Seyi Kuforiji	c7784ba600	t/unit-tests: convert reftable merged test to use clar Adapt reftable merged test file to use clar testing framework by using clar assertions where necessary. Signed-off-by: Seyi Kuforiji <kuforiji98@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-07-24 11:46:02 -07:00
Seyi Kuforiji	a83bf04d8b	t/unit-tests: convert reftable block test to use clar Adapt reftable block test file to use clar testing framework by using clar assertions where necessary. Signed-off-by: Seyi Kuforiji <kuforiji98@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-07-24 11:46:02 -07:00
Seyi Kuforiji	ed5dcbf2f0	t/unit-tests: convert reftable basics test to use clar test framework Adapt reftable basics test file to clar by using clar assertions where necessary.Break up test edge case to improve modularity and clarity. Signed-off-by: Seyi Kuforiji <kuforiji98@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-07-24 11:46:02 -07:00
Seyi Kuforiji	5dd5c4e345	t/unit-tests: implement clar specific reftable test helper functions Helper functions defined in `t/unit-tests/lib-reftable.{c,h}` are required for the reftable-related test files to run. In the current implementation these functions are designed to conform with our homegrown unit-testing structure. So in other to convert the reftable test files, there is need for a clar specific implementation of these helper functions. Implement equivalent helper functions in `lib-reftable-clar.{c,h}` to use clar. These functions conform with the clar testing framework and become available for all reftable-related test files implemented using the clar testing framework, which requires them. This will be used by subsequent commits. Signed-off-by: Seyi Kuforiji <kuforiji98@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-07-24 11:46:01 -07:00
René Scharfe	3d5091d232	prio-queue: add prio_queue_replace() Add a function to replace the top element of the queue that basically does the same as prio_queue_get() followed by prio_queue_put(), but without the work by prio_queue_get() to rebalance the heap. It can be used to optimize loops that get one element and then immediately add another one. That's common e.g., with commit history traversal, where we get out a commit and then put in its parents. Signed-off-by: René Scharfe <l.s.r@web.de> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-07-22 07:28:35 -07:00
shejialuo	6e5b26c3ff	u-string-list: move "remove duplicates" test to "u-string-list.c" We use "test-tool string-list remove_duplicates" to test the "string_list_remove_duplicates" function. As we have introduced the unit test, we'd better remove the logic from shell script to C program to improve test speed and readability. As all the tests in shell script are removed, let's just delete the "t0063-string-list.sh" and update the "meson.build" file to align with this change. Also we could simply remove "DISABLE_SIGN_COMPARE_WARNINGS" due to we have already deleted related code. Unfortunately, we cannot totally remove "test-string-list.c" due to that we would test the performance of sorting about string list by executing "test-tool string-list sort" in "p0071-sort.sh". Signed-off-by: shejialuo <shejialuo@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-07-07 08:07:47 -07:00
shejialuo	7e7ce78265	u-string-list: move "filter string" test to "u-string-list.c" We use "test-tool string-list filter" to test the "filter_string_list" function. As we have introduced the unit test, we'd better remove the logic from shell script to C program to improve test speed and readability. Signed-off-by: shejialuo <shejialuo@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-07-07 08:07:46 -07:00
shejialuo	62c514a9ef	u-string-list: move "test_split_in_place" to "u-string-list.c" We use "test-tool string-list split_in_place" to test the "string_list_split_in_place" function. As we have introduced the unit test, we'd better remove the logic from shell script to C program to improve test speed and readability. Signed-off-by: shejialuo <shejialuo@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-07-07 08:07:46 -07:00
shejialuo	07d90fda58	u-string-list: move "test_split" into "u-string-list.c" We rely on "test-tool string-list" command to test the functionality of the "string-list". However, as we have introduced clar test framework, we'd better move the shell script into C program to improve speed and readability. Create a new file "u-string-list.c" under "t/unit-tests", then update the Makefile and "meson.build" to build the file. And let's first move "test_split" into unit test and gradually convert the shell script into C program. In order to create `string_list` easily by simply specifying strings in the function call, create "t_vcreate_string_list_dup" function to do this. Then port the shell script tests to C program and remove unused "test-tool" code and tests. Signed-off-by: shejialuo <shejialuo@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-07-07 08:07:46 -07:00
Junio C Hamano	90eedabbf7	Merge branch 'ps/reftable-read-block-perffix' Performance regression in not-yet-released code has been corrected. * ps/reftable-read-block-perffix: reftable: fix perf regression when reading blocks of unwanted type	2025-05-19 16:02:48 -07:00
Jeff King	2744646834	oidmap: rename oidmap_free() to oidmap_clear() This function does not free the oidmap struct itself; it just drops all items from the map (using hashmap_clear_() internally). It should be called oidmap_clear(), per CodingGuidelines. Signed-off-by: Jeff King <peff@peff.net> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-05-12 13:06:26 -07:00
Patrick Steinhardt	1970333644	reftable: fix perf regression when reading blocks of unwanted type In `fd888311fb` (reftable/table: move reading block into block reader, 2025-04-07), we have refactored how reftable blocks are read so that most of the logic is contained in the "block.c" subsystem itself. Most importantly, the whole logic to read the data itself is now contained in that subsystem. This change caused a significant performance regression though when reading blocks that aren't of the specific type one is searching for: Benchmark 1: update-ref: create 100k refs (revision = fd888311fbc~) Time (mean ± σ): 2.171 s ± 0.028 s [User: 1.189 s, System: 0.977 s] Range (min … max): 2.117 s … 2.206 s 10 runs Benchmark 2: update-ref: create 100k refs (revision = `fd888311fb`) Time (mean ± σ): 3.418 s ± 0.030 s [User: 2.371 s, System: 1.037 s] Range (min … max): 3.377 s … 3.473 s 10 runs Summary update-ref: create 100k refs (revision = fd888311fbc~) ran 1.57 ± 0.02 times faster than update-ref: create 100k refs (revision = `fd888311fb`) The root caute of the performance regression is that we changed when exactly blocks of an uninteresting type are being discarded. Previous to the refactoring in the mentioned commit we'd load the block data, read its type, notice that it's not the wanted type and discard the block. After the commit though we don't discard the block immediately, but we fully decode it only to realize that it's not the desired type. We then discard the block again, but have already performed a bunch of pointless work. Fix the regression by making `reftable_block_init()` return early in case the block is not of the desired type. This fixes the performance hit: Benchmark 1: update-ref: create 100k refs (revision = HEAD~) Time (mean ± σ): 2.712 s ± 0.018 s [User: 1.990 s, System: 0.716 s] Range (min … max): 2.682 s … 2.741 s 10 runs Benchmark 2: update-ref: create 100k refs (revision = HEAD) Time (mean ± σ): 1.670 s ± 0.012 s [User: 0.991 s, System: 0.676 s] Range (min … max): 1.652 s … 1.693 s 10 runs Summary update-ref: create 100k refs (revision = HEAD) ran 1.62 ± 0.02 times faster than update-ref: create 100k refs (revision = HEAD~) Note that the baseline performance is lower than in the original due to a couple of unrelated performance improvements that have landed since the original commit. Signed-off-by: Patrick Steinhardt <ps@pks.im> Signed-off-by: Junio C Hamano <gitster@pobox.com>	2025-05-12 10:55:24 -07:00
Junio C Hamano	a819a3da85	Merge branch 'ps/reftable-api-revamp' Overhaul of the reftable API. * ps/reftable-api-revamp: reftable/table: move printing logic into test helper reftable/constants: make block types part of the public interface reftable/table: introduce iterator for table blocks reftable/table: add `reftable_table` to the public interface reftable/block: expose a generic iterator over reftable records reftable/block: make block iterators reseekable reftable/block: store block pointer in the block iterator reftable/block: create public interface for reading blocks git-zlib: use `struct z_stream_s` instead of typedef reftable/block: rename `block_reader` to `reftable_block` reftable/block: rename `block` to `block_data` reftable/table: move reading block into block reader reftable/block: simplify how we track restart points reftable/blocksource: consolidate code into a single file reftable/reader: rename data structure to "table" reftable: fix formatting of the license header	2025-04-29 14:21:30 -07:00
Junio C Hamano	779534d5a7	Merge branch 'sk/clar-trailer-urlmatch-norm-test' A few traditional unit tests have been rewritten to use the clar framework. * sk/clar-trailer-urlmatch-norm-test: t/unit-tests: convert urlmatch-normalization test to clar t/unit-tests: convert trailer test to use clar	2025-04-16 13:54:18 -07:00
Junio C Hamano	7b03646f85	Merge branch 'js/comma-semicolon-confusion' Code clean-up. * js/comma-semicolon-confusion: detect-compiler: detect clang even if it found CUDA clang: warn when the comma operator is used compat/regex: explicitly mark intentional use of the comma operator wildmatch: avoid using of the comma operator diff-delta: avoid using the comma operator xdiff: avoid using the comma operator unnecessarily clar: avoid using the comma operator unnecessarily kwset: avoid using the comma operator unnecessarily rebase: avoid using the comma operator unnecessarily remote-curl: avoid using the comma operator unnecessarily	2025-04-15 13:50:17 -07:00

1 2 3 4 5 ...

297 Commits