[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

GNU Parallel 20141122 ('Rosetta') released

From: Ole Tange
Subject: GNU Parallel 20141122 ('Rosetta') released
Date: Sat, 22 Nov 2014 21:45:24 +0100

GNU Parallel 20141122 ('Rosetta') has been released. It is available
for download at:

Haiku of the month:

Hadoop bit too much?
Want a simpler syntax now?
Use GNU Parallel.
  -- Ole Tange

A central piece of command generation was rewritten making this
release beta quality. As always it passes the testsuite, so most
functionality clearly works.

New in this release:

* Remote systems can be divided into hostgroups (e.g. web and db) by
prepending '@groupname/' to the sshlogin. Multiple groups can be given
by separating groups with '+'. E.g. @web/www1 @web+db/www2 @db/mariadb

* Remote execution can be restricted to servers that are part of one
or more groups by '@groupname' as an sshlogin. Multiple groups can be
given by separating groups with '+'. E.g. -S @web or -S @db+web

* With --hostgroup you can restrict arguments to certain hostgroups by
appending '@groupname' to the argument.  Multiple groups can be given
by separating groups with '+'. E.g. my_web_arg@web
db-or-web-arg@db+web db-only-arg@db Thanks to Michel Courtine for
developing a prototype for this.

* HTSeq-Hadoop: Extending HTSeq for Massively Parallel Sequencing Data
Analysis using Hadoop

* GNU Parallel was cited in: SlideToolkit: An Assistive Toolset for
the Histological Quantification of Whole Slide Images

* GNU Parallel was cited in: Exploring a multiprocessor design space
to analyze the impact of using STT-RAM in the memory hierarchy

* Command-Line OCR with Tesseract on Mac OS X

* Bug fixes and man page updates.

GNU Parallel - For people who live life in the parallel lane.

= About GNU Parallel =

GNU Parallel is a shell tool for executing jobs in parallel using one
or more computers. A job is can be a single command or a small script
that has to be run for each of the lines in the input. The typical
input is a list of files, a list of hosts, a list of users, a list of
URLs, or a list of tables. A job can also be a command that reads from
a pipe. GNU Parallel can then split the input and pipe it into
commands in parallel.

If you use xargs and tee today you will find GNU Parallel very easy to
use as GNU Parallel is written to have the same options as xargs. If
you write loops in shell, you will find GNU Parallel may be able to
replace most of the loops and make them run faster by running several
jobs in parallel. GNU Parallel can even replace nested loops.

GNU Parallel makes sure output from the commands is the same output as
you would get had you run the commands sequentially. This makes it
possible to use output from GNU Parallel as input for other programs.

You can find more about GNU Parallel at:

You can install GNU Parallel in just 10 seconds with: (wget -O - || curl | bash

Watch the intro video on

Walk through the tutorial (man parallel_tutorial). Your commandline
will love you for it.

When using programs that use GNU Parallel to process data for
publication please cite:

O. Tange (2011): GNU Parallel - The Command-Line Power Tool, ;login:
The USENIX Magazine, February 2011:42-47.

If you like GNU Parallel:

* Give a demo at your local user group/team/colleagues
* Post the intro videos on Reddit/Diaspora*/forums/blogs/ lists
* Get the merchandise
* Request or write a review for your favourite blog or magazine
* Request or build a package for your favourite distribution (if it is
not already there)
* Invite me for your next conference

If you use GNU Parallel for research:

* Please cite GNU Parallel in you publications (use --bibtex)

If GNU Parallel saves you money:

* (Have your company) donate to FSF

= About GNU SQL =

GNU sql aims to give a simple, unified interface for accessing
databases through all the different databases' command line clients.
So far the focus has been on giving a common way to specify login
information (protocol, username, password, hostname, and port number),
size (database and table size), and running queries.

The database is addressed using a DBURL. If commands are left out you
will get that database's interactive shell.

When using GNU SQL for a publication please cite:

O. Tange (2011): GNU SQL - A Command Line Tool for Accessing Different
Databases Using DBURLs, ;login: The USENIX Magazine, April 2011:29-32.

= About GNU Niceload =

GNU niceload slows down a program when the computer load average (or
other system activity) is above a certain limit. When the limit is
reached the program will be suspended for some time. If the limit is a
soft limit the program will be allowed to run for short amounts of
time before being suspended again. If the limit is a hard limit the
program will only be allowed to run when the system is below the

reply via email to

[Prev in Thread] Current Thread [Next in Thread]