[Buildroot] [PATCH RESEND v2] ppd-merge: speed up per-package-rsync
Brandon Maier
brandon.maier at collins.com
Wed Jun 12 18:22:46 UTC 2024
From: Brandon Maier via buildroot <buildroot at buildroot.org>
The per-package-rsync stage can add a significant amount of time to
builds. They can also be annoying as the target-finalize and
host-finalize targets are the slowest and run on every `make all`, which
is used frequently for partial rebuilds.
The per-package-rsync is slow because it launches a new rsync for each
source tree, and each rsync must rescan the destination directory and
potentially overwrite files multiple times. We can instead merge all the
rsync calls down into one call, and rsync is smarter about scanning all
the source directories and only copying over the files it needs to.
We feed the source trees to rsync in reverse-order, as this preserves
the original behaviour. I.e. when using multiple rsyncs, the last source
tree would overwrite anything in the destination. Now when using a
single rsync, we put the last tree first as rsync will select the first
file it finds.
This only supports the 'copy' mode, which is used in the finalize step.
The 'hardlink' mode requires specifying each source tree with the
--link-dest flag but enforces a maximum of 20 trees.
Below is a benchmark running the host-finalize target for a build with
200 packages.
Benchmark 1: before copy
Time (mean ± σ): 27.171 s ± 0.777 s [User: 6.170 s, System: 14.830 s]
Range (min … max): 26.343 s … 28.566 s 10 runs
Benchmark 2: after copy
Time (mean ± σ): 6.296 s ± 0.196 s [User: 2.874 s, System: 5.600 s]
Range (min … max): 6.094 s … 6.709 s 10 runs
Summary
after copy ran
4.32 ± 0.18 times faster than before copy
Cc: Herve Codina <herve.codina at bootlin.com>
Cc: Yann E. MORIN <yann.morin.1998 at free.fr>
Cc: Thomas Petazzoni <thomas.petazzoni at bootlin.com>
Reviewed-by: Herve Codina <herve.codina at bootlin.com>
Signed-off-by: Brandon Maier <brandon.maier at collins.com>
---
v1: https://patchwork.ozlabs.org/project/buildroot/patch/20231127224139.35969-1-brandon.maier@collins.com/
v2:
- Simplify logic for 'copy' mode
- Drop support for 'hardlink' mode
---
package/pkg-utils.mk | 18 ++++++------------
1 file changed, 6 insertions(+), 12 deletions(-)
diff --git a/package/pkg-utils.mk b/package/pkg-utils.mk
index d1964299af..3ab9e9fe52 100644
--- a/package/pkg-utils.mk
+++ b/package/pkg-utils.mk
@@ -221,18 +221,12 @@ ifeq ($(BR2_PER_PACKAGE_DIRECTORIES),y)
# $4: literal "copy" or "hardlink" to copy or hardlink files from src to dest
define per-package-rsync
mkdir -p $(3)
- $(foreach pkg,$(1),\
- rsync -a \
- --hard-links \
- $(if $(filter hardlink,$(4)), \
- --link-dest=$(PER_PACKAGE_DIR)/$(pkg)/$(2)/, \
- $(if $(filter copy,$(4)), \
- $(empty), \
- $(error per-package-rsync can only "copy" or "hardlink", not "$(4)") \
- ) \
- ) \
- $(PER_PACKAGE_DIR)/$(pkg)/$(2)/ \
- $(3)$(sep))
+ $(if $(filter hardlink,$(4)), \
+ $(foreach pkg,$(1),\
+ rsync -a --hard-links --link-dest=$(PER_PACKAGE_DIR)/$(pkg)/$(2)/ \
+ $(PER_PACKAGE_DIR)/$(pkg)/$(2)/ $(3)$(sep)), \
+ printf "%s/$(2)/\n" $(1) | tac \
+ | rsync -a --hard-links --files-from=- --no-R -r $(PER_PACKAGE_DIR) $(3))
endef
# prepares the per-package HOST_DIR and TARGET_DIR of the current
---
base-commit: 12b2860d7562d5cf1b7492b58f4f10f0405444c4
change-id: 20240612-ppd-merge-b4-8c623ba7ab90
Best regards,
--
Brandon Maier <brandon.maier at collins.com>
More information about the buildroot
mailing list