Misplaced Pages

GNU parallel

Article snapshot taken from[REDACTED] with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Shell tool for executing jobs in parallel
This article relies excessively on references to primary sources. Please improve this article by adding secondary or tertiary sources.
Find sources: "GNU parallel" – news · newspapers · books · scholar · JSTOR (September 2015) (Learn how and when to remove this message)
Parallel
Developer(s)GNU Project
Stable release20241022 / 21 October 2024; 2 months ago (2024-10-21)
Repository
Written inPerl
Operating systemGNU
TypeUtility
LicenseGPLv3
Websitewww.gnu.org/software/parallel/ Edit this on Wikidata

GNU parallel is a command-line utility for Linux and other Unix-like operating systems which allows the user to execute shell scripts or commands in parallel. GNU parallel is free software, written by Ole Tange in Perl. It is available under the terms of GPLv3.

Usage

Introduction video, Part 1
Introduction video, Part 2

The most common usage is to replace the shell loop, for example

while read x; do 
    do_something "$x"
done < list

to the form of

cat list | parallel do_something

where the file list contains arguments for do_something and where process_output may be empty.

Scripts using parallel are often easier to read than scripts using pexec.

The program parallel features also

  • grouping of standard output and standard error so the output of the parallel running jobs do not run together;
  • retaining the order of output to remain the same order as input;
  • dealing nicely with filenames containing special characters such as space, single quote, double quote, ampersand, and UTF-8 encoded characters;

By default, parallel runs as many jobs in parallel as there are CPU cores.

Examples

find . -name "*.foo" | parallel grep bar

The above is the parallel equivalent to:

find . -name "*.foo" -exec grep bar {} +

This searches in all files in the current directory and its subdirectories whose name end in .foo for occurrences of the string bar. The parallel command will work as expected unless a file name contains a newline. In order to avoid this limitation one may use:

find . -name "*.foo" -print0 | parallel -0 grep bar

The above command uses the null character to delimit file names.

find . -name "*.foo" | parallel -X mv {} /tmp/trash

The above command expands {} with as many arguments as the command line length permits, distributing them evenly among parallel jobs if required. This can lower process overhead for short-lived commands that take less time to finish than they do to launch.

find . -maxdepth 1 -type f -name "*.ogg" | parallel -X -r cp -v -p {} /home/media

The command above does the same as:

cp -v -p *.ogg /home/media

However, the former command which uses find/parallel/cp is more resource efficient and will not halt with an error if the expansion of *.ogg is too large for the shell.

See also

References

  1. Tange, Ole (21 Oct 2024). "GNU Parallel 20241022 ('Sinwar Nasrallah') released [stable]". parallel (Mailing list).
  2. "GNU Parallel". GNU.org.

External links

GNU Project
History
Licenses
Software
Contributors
Other topics
Categories:
GNU parallel Add topic