DPDK: Fix source for tarball #3505

mcgov · 2024-11-11T20:31:24Z

Fix the way the TarDownloader handles dowloading files and identifying their filenames. This required fixing a bug in the Tar tool where Linux would not infer the filename in the default case like the Windows version does. Previously, providing a directory and not a filename would result in the tool identifying an entire directory of files as the filename when the Wget tool fetches a cached result.

This PR fixes that issue by allowing the tool to infer the filename, this fixes the ability to use the Wget tool without force_run when using a default filename.

Fixing one bug uncovered a few others. Fixes the way the Tar tool handles fetching the filename of the tar file it downloads.

lisa/base_tools/wget.py

When running Wget.get(..., force_run=False) and Tar.extract it is useful to allow Tar to skip extracting existing files on the second pass. Allow the skip-old-files option, so Tar.extract will not overwrite existing files in the output directory. Note: it's important to not use this option if you are providing LISA with a default filename for your tarballs. This option could silently allow Tar.extract to not update the contents of a directory with the newer file contents. I don't see anyone using that schema now. My apologies to future devs who find this commit message while debugging that issue.

mcgov · 2024-11-12T17:09:04Z

Note: testing in progress.

squirrelsc · 2024-11-12T17:57:20Z

lisa/base_tools/wget.py

@@ -24,6 +24,10 @@ class Wget(Tool):
    def command(self) -> str:
        return "wget"

+    def _initialize(self, *args: Any, **kwargs: Any) -> None:
+        self.__filename_result_cache: Dict[str, str] = dict()


This variable should be used by subclasses, so one underscore is right.

Call it like _url_file_map. It explains the variable names.

squirrelsc · 2024-11-12T18:05:50Z

lisa/base_tools/wget.py

@@ -45,8 +49,26 @@ def get(
        force_run: bool = False,
        timeout: int = 600,
    ) -> str:
+        if not force_run:


The logic could be simpler.

cached_filename = self.__filename_result_cache.get(url, None) if cached_filename: if force_run: del self.__filename_result_cache[url] else: return cached_filename

squirrelsc · 2024-11-12T18:15:20Z

lisa/tools/tar.py

@@ -48,6 +49,21 @@ def extract(
        if strip_components:
            # optionally strip N top level components from a tar file
            tar_cmd += f" --strip-components={strip_components}"
+
+        if skip_old_files:


How about remove existing extracted files always, or always overwrites? The skip-old-files is hard to use with other tools right.

mcgov added 2 commits November 11, 2024 12:27

urlparse tar fix

fa372b0

Fixing one bug uncovered a few others. Fixes the way the Tar tool handles fetching the filename of the tar file it downloads.

Wget: provide default filename is none provided on Linux

7098abd

mcgov requested review from squirrelsc and LiliDeng as code owners November 11, 2024 20:31

squirrelsc reviewed Nov 11, 2024

View reviewed changes

lisa/base_tools/wget.py Show resolved Hide resolved

mcgov added 3 commits November 11, 2024 17:33

Wget: add caching

e69a23b

Dpdk: re add file_path

fd74060

TarDownloader: enable skip_old_files option in tar extract

a0638eb

mcgov force-pushed the mcgov/source_fix branch from 2bed636 to a0638eb Compare November 12, 2024 17:35

squirrelsc reviewed Nov 12, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DPDK: Fix source for tarball #3505

DPDK: Fix source for tarball #3505

mcgov commented Nov 11, 2024 •

edited

Loading

mcgov commented Nov 12, 2024

squirrelsc Nov 12, 2024 •

edited

Loading

squirrelsc Nov 12, 2024

squirrelsc Nov 12, 2024

DPDK: Fix source for tarball #3505

Are you sure you want to change the base?

DPDK: Fix source for tarball #3505

Conversation

mcgov commented Nov 11, 2024 • edited Loading

mcgov commented Nov 12, 2024

squirrelsc Nov 12, 2024 • edited Loading

Choose a reason for hiding this comment

squirrelsc Nov 12, 2024

Choose a reason for hiding this comment

squirrelsc Nov 12, 2024

Choose a reason for hiding this comment

mcgov commented Nov 11, 2024 •

edited

Loading

squirrelsc Nov 12, 2024 •

edited

Loading