Bash functions are better than I thought

throwaway984393 · on Oct 31, 2021

Getting anything out of a subshell that isn't from STDOUT is impossible. So you can't define an array in a subshell and then use it outside the subshell, and you can't return an array (or anything that isn't a string) from a subshell. If you only use subshells and want to use any kind of data structure that isn't a string passed from STDOUT, you have to do it globally. And subshells are slow. So nobody uses subshells.

If you use Bash for programming, you have to stop thinking in terms of the holier-than-thou software engineer, whose ego believes that a superior, "clean" design makes a superior program. You should embrace globals. You should switch between using or not using the enforcement of set variables or program exit status. You should stop using Bashisms and subtle, obscure language features unless you absolutely have to.

Bash is not a "real" programming language, so do not treat it as one. Do not look for hidden features, or try to do things in cute ways that nobody else uses. There is no superior method or hidden knowledge. Just write extremely simple code and understand the quirks of the shell.

gfodor · on Oct 31, 2021

This reminds me of the arguments made against writing “real” code in JavaScript in the early days of the web, until Crockford came along and wrote “The Good Parts.” There is no reason to think that a few idioms and curating features could go a long way to leading to a much better, less hacky paradigm for bash shell scripting.

hsn915 · on Oct 31, 2021

Bash is much older than JavaScript. If it was going to turn into a real programming language, it would have by now. It hasn't.

Also, JavaScript really is not a very good programming language. We are just stuck with it because it's the only language the browser understands. (Well, until recently: things are going to change with the introduction of wasm).

But for the shell, we're not stuck with any one language. Whatever you want to do can be programmed in your favorite language. You can easily write a python script instead of a bash function.

raziel2p · on Oct 31, 2021

> things are going to change with the introduction of wasm

people have been saying this for years already and nothing seems to have come of it :(

heavyset_go · on Oct 31, 2021

I've seen some pretty interesting things like the use of Rust for front end development, like yew[1] and Seed[2].

There aren't many languages that are practical for WASM output. Scripting and managed languages need to ship their interpreters and runtimes with their WASM blobs, and can end up relatively large. JavaScript's interpreter and runtime are baked into every browser already.

That leaves only compiled and unmanaged languages for potentially good WASM targets. As mentioned before, Rust is seeing a lot development in that space. If LLVM can compile it, then Emscripten can output it to WASM.

[1] https://github.com/yewstack/yew

[2] https://github.com/seed-rs/seed

hsn915 · on Nov 1, 2021

Are you sure nothing came of it?

Figma for example seems to be largely powered by WASM and the core of the product seems to be written in C++

https://www.figma.com/blog/webassembly-cut-figmas-load-time-...

https://www.figma.com/blog/building-a-professional-design-to...

I think the majority of web developers are just slow at catching up with this new things.

steve_adams_86 · on Oct 31, 2021

Is that true though? Aren’t people using it for in-browser image and video editing? Doesn’t Google use it for drawing to Sheets and crunching numbers?

I thought that was the case but I can’t recall where I read it. If that’s not true then I should probably reconsider my thinking about wasm

heavyset_go · on Oct 31, 2021

Yes, WASM is a good target for CPU bound number crunching. I'm not sure if it's used in Sheets, though.

twox2 · on Nov 1, 2021

Wasm isn't really going to change the front-end / browser side of javascript for the better, but it's great for graphics and anything else cpu intensive.

copperx · on Nov 1, 2021

WASM DOM manipulation will probably come soon, and that will place other languages in the exact same playing field as Javascript. For now, manipulating the DOM still requires calling Javascript.

tambourine_man · on Oct 31, 2021

Yeah. Turns out, having an interpreted runtime in the browser is quite useful and sending bytecode across the wire not as practical.

We've been forgetting and relearning that for the past two decades.

hsn915 · on Nov 1, 2021

"turns out"? How did it turn out so?

Actually it turns out that having an interpreted runtime causes a stream of endless problem. We keep try to patch them over but new problems keep coming up.

It turns out that a standarized low level byte code is the right way to ship cross platform applications. We already knew that. It just took a long time to go through the standarization process and be implemented by browsers.

If it seems like it's not in use today, it's largely because JavaScript has momentum and the majority of web programmers don't yet know how to take advantage of wasm (or maybe don't think they need it).

tambourine_man · on Nov 1, 2021

> It turns out that a standarized low level byte code is the right way to ship cross platform applications

So why hasn’t it taken over the world with Java and now with wasm?

> If it seems like it's not in use today, it's largely because JavaScript has momentum and the majority of web programmers don't yet know how to take advantage of wasm (or maybe don't think they need it).

There’s another possible reason. I’m sure you can come up with the answer yourself.

hsn915 · on Nov 2, 2021

Javascript never really had a chance to "take over the world" before Google released Chrome and the V8 engine. Before that it was just too slow.

Java applets really sucked. They took a long time to load and initialize. They all looked ugly (probably because of the default libraries?).

Also Java itself as a language was really bad and the development experience was awful. I want to say that no one wants to program in Java but in reality many people do (I don't understand those people).

The important lesson here is the implementation is more important than the idea.

Good idea with bad implementation -> goes no where.

wasm is not java.

The important quality about wasm is that it's not garbage collected. It's pretty close to just good old assembly.

tambourine_man · on Nov 2, 2021

>Javascript never really had a chance to "take over the world" before Google released Chrome and the V8 engine.

Chrome was released at the end of 2008, JavaScript was thriving way before that. We had Gmail since 2004, jQuery since 2006. WebApps were the “sweet solution” considered for iPhone apps at first in 2007. Chrome exists because of the healthy ecosystem that Firefox and Safari provided, not the other way around.

>It's pretty close to just good old assembly.

Precisely, and it's yet to be proven that's the best solution for the Web. I love it from the computer science perpective, but historically that idea hasn't struck a chord.

dotancohen · on Nov 1, 2021

  > JavaScript really is not a very good programming language.

Since 2015, JavaScript really is a nice language. I never thought that I'd say that.

Go look at ES6 (the javascript version that came out in 2015) and even TypeScript. You'll be pleasantly surprised, no matter if you're coming from C++, Java, C# or even other "scripting" languages such as PHP and Python.

hsn915 · on Nov 1, 2021

I use typescript almost daily. It's certainly better than Python or Ruby, I'll grant you that. But I don't think Python is a good language either.

Ultimately I want a langauge with value types like structs and arrays that you can use to do computations without allocating things on the heap.

In Go I can write a function that returns two numbers.

In Javascript I can only return two numbers by allocating an array on the heap with two items. These two items themselves are probably pointers to two number objects allocated separately. So that's 5 allocations to return two numbers. It's insane.

This means fundamentally it's impossible to create applications in javascript that are both sophisticated and high performance.

A great example to illustrate this is the tooling around Javascript itself.

There were all kinds of bundlers: webpack, rollup, parcel, etc. Rollup was considered the fastest and lightest. They were all programmed in Javascript itself. They were all slow. But no one really knew how bad it was because there was nothing really better.

Then esbuild comes along, and blows all of them completely out of the water, out performing them by 100 times. And it's written in Go. A language that supports structs and arrays as value types.

gfodor · on Nov 1, 2021

It’s not fundamentally impossible to build fast apps in JS. It’s just not possible to do if you don’t do the legwork of writing code that the compiler will optimize for you. It’s not easy, and it’s not natural in a lot of cases, but saying it is fundamentally impossible goes too far: people build high performance applications like games in JS.

For example, you are worried about value types but under the hood the JIT compiler will actually generate efficient representations that are passed by value if you do not mess things up for it. Modern JS compilers are extremely sophisticated.

For returning values without an allocation, you’d create such a “struct” and then use a memory pool. It would still be heap allocated I believe but not totally sure. People who know more than I do could probably tell you a better method to return multiple values from a function efficiently.

seba_dos1 · on Nov 1, 2021

> people build high performance applications like games in JS

Yes, and in my experience this kind of applications (and the browser in general) are one of the only things these days that still push me to upgrade my hardware to have better CPUs and more RAM, as it's perfectly sufficient as is for pretty much everything else I do.

gfodor · on Nov 2, 2021

You can’t tell from the outside if you are running a well engineered JS app that would have been fast in another platform, or a non-optimized JS app. My point was that it’s not fundamentally impossible to write fast enough JS code for a game, even though you may come across many slow JS apps regularly. (Not surprising, given the wide distribution and low barrier to entry of the web.)

seba_dos1 · on Nov 2, 2021

Yes, you can operate strictly on typed arrays and essentially implement something alike a virtual machine inside your JS code to make it handle memory efficiently, so you're right - it's not "fundamentally impossible", but are you still really writing JavaScript at that point? :P

addicted · on Nov 1, 2021

What’s wrong with modern JavaScript as a programming language (although I would maybe point to typescript which adds types to JS instead)?

seba_dos1 · on Nov 1, 2021

There's no distinct "modern JavaScript", it's still the same language. It just has bunch of things added to make it more manageable, but it still contains all the footguns and gotchas as it did before.

Its tooling got significantly better in the last decade or so, but you still need to intentionally rely on it in order to work with the language in any sensible way. You can't really hope for any other outcome when maintaining backwards compatibility.

hsn915 · on Nov 2, 2021

The tooling only got better last year with esbuild.

Everything else is either unmanageably complicated (webpack) or slow (rollup, parcel, babel).

React mainstreamed the idea of using a virtualdom to accelerate UI building (accelerate in the computational sense: by not dealing directly with the slow DOM).

seba_dos1 · on Nov 3, 2021

I remember how webdev looked like 15 years ago. The tooling did get better even before esbuild - but I never said it was "good" :)

hsn915 · on Nov 2, 2021

I responded to the sibling comment asking the same question

https://news.ycombinator.com/item?id=29068066

heavyset_go · on Oct 31, 2021

> There is no reason to think that a few idioms and curating features could go a long way to leading to a much better, less hacky paradigm for bash shell scripting.

A couple of years ago, I challenged myself to write some complex scripts and apps using only Bash, and came to this conclusion myself.

You can get pretty far with just Bash alone, especially if you strive to write readable and maintainable code, and not just one off scripts. If you need access to data structures, especially nested ones, you can shell out to another language and then print the results back to stdout so Bash can use them:

    $ get_data | transform_data | python3 ./complex_script.py | consume_results

majormajor · on Nov 1, 2021

But once you shell out and require another language, why not just use that other one from the start?

coliveira · on Nov 1, 2021

Because the shell is used to do things that are easy in the shell. Examples: file manipulation, sorting, field based processing, pipes. Doing these things in a traditional language is more complex and not necessarily better.

brokenmachine · on Nov 2, 2021

Except the shell is bad for those things, except pipes I guess.

I just hate doing things with files in the shell, all those stupid escaping rules, and god forbid you have files with spaces or quotes in the names, or leading or trailing spaces.

It's just spending time learning hacks to get around shell limitations instead of actually getting things done, and at the end it's an ugly mess.

Or maybe I'm just bad at it, but I don't think it's only that.

vlovich123 · on Nov 1, 2021

It’s not more complex in Python if you use the sh package. All the power of everything you need with a real language when you need to do more.

Packaging in Python is a bit more of a pain.

coliveira · on Nov 1, 2021

That's the other part of the problem. The shell is available everywhere. Python needs to be installed. So, if you need shell-related functionality, it is the easiest route to use.

vlovich123 · on Nov 1, 2021

I have not encountered a system yet where Python isn’t installed out of the box.

dotancohen · on Nov 1, 2021

Windows, possibly. And as for the *nixes, including Macs, what version is installed? The way Python 2 eol was extended and extended and extended just hurt the whole ecosystem.

vlovich123 · on Nov 1, 2021

Windows doesn't have bash preinstalled so I'm not sure why you're bringing that up. On all OSes I've used in the past 3-4 years all of them have been Python3 by default. The specific version is largely irrelevant as I could ask you the same question about which version of bash is running on a given OS (e.g. it's not available on Mac out of the box as the default shell is zsh due to GPLv3).

dotancohen · on Nov 4, 2021

  > Windows doesn't have bash preinstalled

I meant Python preinstalled.

  > On all OSes I've used in the past 3-4 years all of them have been Python3 by default.

Windows comes with Python preinstalled now?

Kavelach · on Nov 1, 2021

I did a similar thing a couple of months ago, writing a static site generator for my blog in sh. You can write bigger programs as long as you keep discipline, albeit it is a pain.

rustshellscript · on Nov 1, 2021

Here is the rust library with the “the only good parts” of bash/shell script: https://github.com/rust-shell-script/rust_cmd_lib

chrisweekly · on Nov 1, 2021

"no reason to think ... could"

should read

"no reason to think ... couldn't"

cryptonector · on Nov 1, 2021

> Getting anything out of a subshell that isn't from STDOUT is impossible.

You can use I/O redirection with arbitrary file descriptor numbers, and you can let the shell pick them. So you can `mknod` a pipe, `exec {pipefd_rw}<>"$pipe_name" {pipefd_ro}<"$pipe_name" {pipefd_rw}>"$pipe_name";` and now you have three open file descriptors that you can let the sub-shell use to communicate with the parent. And so you're not limited to stdin/stdout. I do wish there was a built-in for making an anonymous pipe instead of having to make one on filesystem.

(You have to open the named pipe for read-write first because otherwise you'll block, but if you don't want a read-write FD for it you can close it after opening the read and write FDs.)

michaelcampbell · on Oct 31, 2021

> So nobody uses subshells.

I'll count myself among nobodies, I guess. I use them a fair bit when part of said subshell cd's somewhere, and I don't feel like using pushd/popd, either of which might fail to operate based on my mistakes, but subshells seem to never fail to exit, eventually.

dotancohen · on Nov 1, 2021

Depends what you're doing. If you need the exit status of something _in_ that subshell, or even just its STDOUT, then you're stuck.

fiddlerwoaroof · on Oct 31, 2021

If you take the time and effort to understand how bash wants you to think, you can learn how to right elegant scripts that are as maintainable as anything else.

jefftk · on Oct 31, 2021

I've written a lot in bash over the years, and I feel like I understand it pretty well. But I would never say that even elegant bash scripts are as maintainable as "anything else". It is a clunky programming environment, born of compromises, with many traps that are easy to miss in code review.

tempodox · on Oct 31, 2021

There's `shellcheck`, as part of Syntastic, for vim. I find it quite useful. You could call it a Bash linter.

pletnes · on Oct 31, 2021

Shellcheck is great and is a standalone cli program, as well as a website if you don’t want to install it locally. https://www.shellcheck.net/

lazyweb · on Oct 31, 2021

I'd rather always check my code locally if the alternative is pasting it into a random web from.

On the other hand, I wonder if the web page keeps track and could offer "best of bash mistakes".

felipelemos · on Nov 1, 2021

Well, you can. From the website itself:

"ShellCheck is...

* GPLv3: free as in freedom

* available on GitHub (as is this website)

* already packaged for your distro or package manager

* supported as an integrated linter in major editors

* available in CodeClimate, Codacy and CodeFactor to auto-check your GitHub repo

* written in Haskell, if you're into that sort of thing. "

Sometimes you are not in your computer, the script does not have private information (i.e. open source or something you don't care to be public). Sometimes the website is simply more convenient.

*edit: formatting

naniwaduni · on Oct 31, 2021

A random web form can exfiltrate the data you paste into it (and whatever your browser lets it gather). A local program can exfiltrate ... approximately everything of value on the machine?

dotancohen · on Nov 1, 2021

That's why Linux users typically install things from their distro's package manager. The bar to get malicious software in there is very very high (though it is not impossible).

But if you're still using Windows, then yes, I agree.

jamesrr39 · on Oct 31, 2021

While I understand the sentiment, I'm not sure how bash could ever be as maintainable as a something written in e.g. Python (or even better, a strongly-typed language).

The thing with bash is, it's great for tying things together and quick bits and pieces, but it's not set up for writing maintainable code. Arrays, functions, even if statements comparisons can all be done in bash (as first-class features), but are just... easier in other languages. And then think about the refactoring, linting, testing tools available in bash vs other languages. And then on top of that, there's the issue of handling non-zero return codes from programs you call; do you `set -e`, and exit on any non-zero return code even if you wanted to continue, or not `set -e`, ignoring any errors as your script just continues.

Personally, when I feel I want to use a function (or array, or other similar, non-trivial thing), in bash, it's time to reach for another language.

Having said that, there are some nice programs written in bash. https://www.passwordstore.org/ being one that comes to mind.

mkl · on Oct 31, 2021

> e.g. Python (or even better, a strongly-typed language).

Python is strongly typed. Maybe you meant "statically"? (As opposed to dynamically.)

jamesrr39 · on Oct 31, 2021

Yes, you are right, and statically is what I meant, thanks for the correction.

einpoklum · on Oct 31, 2021

> Personally, when I feel I want to use a ... non-trivial thing ... in bash, it's time to reach for another language.

Then you must not be writing any bash at all. Functions are useful for almost anything beyond a one-liner script.

Typical functions used in innumerable non-trivial scripts:

* print_usage()

* die() (see: https://stackoverflow.com/q/7868818/1593077)

:-)

jamesrr39 · on Oct 31, 2021

You are kind of right, I don't write much bash, but I do write some simple scripts that I can call quickly and easily (e.g. start this program with these args, write the log file here with the filename as the current date, etc). Although regarding "Then you must not be writing any bash at all"; I'm not sure how you could have deduced this!

With regards to `print_usage()` and `die()`, yes, I would reach for Python 3 then. The `argparse` module and `throw` are first-class members of the stdlib/language and are better and more standard between programs than if I threw together these myself (and with `throw` you get a stack trace, which is nice).

rowanG077 · on Oct 31, 2021

Can you point me towards an elegant bash script? I honestly have never seen one that is more then a few lines.

gavinray · on Oct 31, 2021

I like to think this thing I wrote isn't so bad and is fairly readable, even if you don't know anything about shell programming:

https://github.com/GavinRay97/hasura-ci-cd-action/blob/a9731...

This is out of necessity. I'm not the sharpest tool in the shed, so I have to go out of my way to write things such that when I come back to them in months or years, I still understand what they do.

For this same reason, is also why I never use shorthand flags for scripts.

I have no clue what IE: "-s -y -o" might do (or even worse, "-syo", dear god my eyes! Also -- is that one special command, or a series of individual commands?!).

But "--silent --assume-yes --output-file" is pretty easy to grok immediately.

confidantlake · on Oct 31, 2021

I agree with you that it seems readable, nice job on it. But there is comment there that says

Oh man this is so ugly

Not something I would expect in an "elegant" script. Don't think that it is your fault, just it is very hard to write anything elegant in bash.

gavinray · on Oct 31, 2021

Ah yes, that section.

Hahaha -- fair point. Readable: maybe. Elegant? No. Bash is far from elegant =P

mdpye · on Oct 31, 2021

I would add a variable ENDPOINT, initialised to "" and set to " --endpoint $THEENDPOINTVALUE" if the endpoint value was passed. Then include that in every invocation?

Sorry if I missed something in the logic, reading on my phone, but from the comment, this feels like something I do frequently...

Pokepokalypse · on Nov 1, 2021

Or, (thanks, Amazon): `--no-paginate` `--no-cli-pager` (dang that pisses me off! thank goodness for `jq`).

ilyash · on Oct 31, 2021

Sorry. "elegant" here triggered.

https://ilya-sher.org/2021/03/19/running-elegant-bash-on-sim...

knazarov · on Oct 31, 2021

I write a lot of bash, and can kinda agree that it’s full of footguns. The most of them are not even in bash, but in Unix tools, which you have to use due to the lack of standard libraries.

ilyash · on Nov 1, 2021

Somehow I noticed more footguns in bash than in utilities.

MereInterest · on Oct 31, 2021

To have variable typos result in errors with `set -u`, then expand an array that is defined but may be empty, I need write it as `${ARRAY[@]+"${ARRAY[@]}"`. (Further explanation: https://stackoverflow.com/questions/7577052/bash-empty-array...) But maybe bash arrays are too new a feature, and should be avoided, so let's look at something simpler, like command-line arguments.

Parsing command-line arguments is easy, but parsing them correctly is ridiculously hard. `getopts` doesn't handle long flags, so it's out. `getopt` doesn't handle arguments with spaces in them, unless you have a version with non-standard extensions, so it's out. You're left with manual parsing, and it's a royal pain to make sure you handle every expected case (short/long flags, clusters of short flags, trailing arguments to be passed through to a subprocess, short/long options whose value is in the next argument, long options whose value is after an equals sign, and several others I'm probably forgetting about). And this is just to get the arguments into your script, before you actually do anything with it.

I agree that bash has effective idioms, and learning those idioms can make scripts easier to write. I strongly disagree that bash is "as maintainable as anything else", and scripts beyond a few hundred lines should be rewritten before they can continue growing.

databasher · on Nov 1, 2021

Bash `getopts` handles short and long arguments gnu-style without any problem. The following code handles args like "-h", "-i $input-file", "-i$input-file", "--in=$input-file", and "--help":

  while getopts :i:h-: option
  do case $option in
         # accept -i $input-file, -i$input-file
         i ) input_file=$OPTARG;;
         h ) print_help;;
         - ) case $OPTARG in
                 help ) print_help;;
                 help=* ) echo "Option has unexpected argument: ${OPTARG%%=*}" >&2; exit 1;;
                 in=* ) input_file=${OPTARG##*=};;
                 in ) echo "Option missing argument: $OPTARG" >&2; exit 1;;
                 * ) echo "Bad option $OPTARG" >&2; exit 1;;
             esac;;
         '?' ) echo "Unknown option $OPTARG" >&2; exit 1;;
         : ) echo "Option missing argument: $OPTARG" >&2; exit 1;;
         * ) echo "Bad state in getopts" >&2; exit 1;;
     esac
  done
  shift $((OPTIND-1))

drran · on Oct 31, 2021

You can use bash-modules arguments library to generate argument parser for you.

MereInterest · on Oct 31, 2021

Thank you, I haven't heard of that one before. However, all the examples look like it gets called as an external library, which would either need to be distributed with a script, or to already be installed by a user.

Is there a feature I'm not seeing that would generate an argument parser without external dependencies?

drran · on Nov 1, 2021

Yes, arguments::generate_parser function. Install bash-modules, then run `. import.sh log arguments`, then call `arguments::generate_parser '--foo)FOO;String'` , and it will generate function with parser, which include parser for foo option for both `--foo VALUE` and `--foo=VALUE`:

  ...
  
        --foo)
          FOO="${2:?ERROR: String value is required for \"-f|--foo\" option. See --help for details.}"
          shift 2
        ;;
        --foo=*)
          FOO="${1#*=}"
          shift 1
        ;;
  ...

Osiris · on Oct 31, 2021

One way to answer this would be to find the largest (in terms of LoC) Bash script you can find and try to add a feature or fix a bug.

hutrdvnj · on Oct 31, 2021

But in reality you never see such scripts.

worrycue · on Oct 31, 2021

> Bash is not a "real" programming language, so do not treat it as one.

Is there a reason we aren’t using a shell with a proper programming language for scripting?

I can’t say I’m a fan of bashscript. I wonder why we are using this weird language.

jrochkind1 · on Oct 31, 2021

Because cross-OS (not just cross-linux-distro but even that) standards-making doesn't exist anymore in a real non-broken way, and we're stuck with whatever standards of 20+ years ago. Whether official or de facto (technically bash is just de facto although `bash --posix` or `sh` is an official real standard). There basically isn't any "innovation" happening in this space in a way that could really result in anything as pervasive as bash. Maybe also cause it's just not "interesting" anymore, the 'sexy' things are many levels of abstraction higher than they were when bash came to dominance. It feels like now we're just stuck with it forever, indeed.

quotemstr · on Nov 1, 2021

Maybe it's time for some other body --- say, freedesktop.org --- to start standardizing things that have traditionally been in the domain of POSIX.

After all, HTML got much better when WHATWG took W3C's toys away and actually started innovating HTML again. Maybe it's time to do something similar with Unix: I'm sick and tired of people telling everyone that we have to stay in some 1995 time capsule because we need to follow The Standard and if The Standard hasn't changed, too bad.

majormajor · on Nov 1, 2021

You say "don't exist anymore" but when was it better? Pre-Mac-OS-X you didn't even have an easy to access shell there on Macs, and Windows wasn't bash either. So if there was a golden era for that, it never included the two predominant desktop OSes.

De-facto standardization has at least brought us free officially-supported options for running Bash or other *nix shells on both of those platforms(if terribly outdated on Mac).

m3at · on Oct 31, 2021

I agree with your broader point that we seem stuck with old shells and no contenders seems in a position to replace them. However some are certainly trying, like osh [1] that takes backwards compatibility with bash seriously, which makes adoption easier.

Maybe it'll be like the C++ vs Rust situation?

https://www.oilshell.org/

sodapopcan · on Oct 31, 2021

My feeling is that it is about convenience when typing at the terminal. Having a unified language for both scripting and entering commands is quite convenient. Also, bare strings while typing at the terminal is very convenient! It would be really annoying if we always had to add quotes for string args when we're typing at the terminal:

  $ cat "file.txt"

...and I don't even know what options would look like... maybe:

  $ ls "*.txt", ["l", "a"]

I dunno... this would mean we'd probably need parens now for precedent-type concerns:

  $ ls("*.txt", ["l", "a"]) > "file.txt"

Anyway, that is why I understood that it's this way. I can't point to a resource about it, though.

jandrese · on Oct 31, 2021

You look like you are just about to discover Powershell.

psyclobe · on Oct 31, 2021

Powershell is how I instrument cross platform CI scripting, with the power of the module concept and manifess now you can write real powerful type safe apis that can run anywhere. It has its quirks though...

joombaga · on Nov 1, 2021

Any gotchas with running the same scripts on Windows and Linux? I've been thinking about switching to PowerShell. Love it on Windows. Never tried it on Linux but I did try it on macOS a few years ago and remember some quirks. Currently a bash (over)user.

psyclobe · on Nov 1, 2021

One gotcha is powershell core has some different apis then the 'windows powershell', best to deploy core on both systems to unify the api.

Also powershell has no support for automatically handling external shell commands that error, you must manually check $LASTEXITCODE (easily done with a custom invoke api that you use for everything).

Also well, some of the escaping in strings is really odd...

pajko · on Oct 31, 2021

You have to add quotes in bash if the filename contains a space. Variables should also be quoted for the same reason, because their values might contain spaces. "${var}"

dahfizz · on Oct 31, 2021

Only in a script. On the command line, tab expansion will escape any special characters in a filename.

fragmede · on Oct 31, 2021

don't forget to handle filenames with newlines in them as well!

fstrthnscnd · on Oct 31, 2021

> It would be really annoying if we always had to add quotes for string args when we're typing at the terminal

With bash (and I suspect with every other popular shells out there), it actually means something different. In your second example, if you don't use quotes, the shell will do the extensions, but if you do, the process will have to do it (and in the case of ls, it cannot).

In some case it's good to have the shell taking care of it, but sometimes (eg, when using 'find') it's not.

sodapopcan · on Oct 31, 2021

My examples are a made-up idea of what a “real” programming language would look like in a shell, not actual bash.

fstrthnscnd · on Nov 1, 2021

Sorry, clearly my comment wasn't very helpful.

I was trying to point out that quoting parameters or not could bring a different meaning for a shell (as one could see it with Bash), with the implied consequence that one would have to carry over that nuance somehow to a shell where all parameters would have to be quoted.

fooofw · on Nov 1, 2021

I feel like this comment is almost a kind of inverse of https://news.ycombinator.com/item?id=23424853 (which I remember to have found interesting).

nextaccountic · on Oct 31, 2021

ruby in some cases feel just like shell scripting, even more than perl.. you can even do piping to chain two commands together https://stackoverflow.com/questions/4234119/ruby-pipes-how-d... (edit: using the shell library https://github.com/ruby/shell so, not out of box in ruby, unfortunately, which is a mortal sin for replacing bash in scripts =/)

Here are the ways to run a program in ruby https://stackoverflow.com/questions/7212573/when-to-use-each...

chasil · on Oct 31, 2021

Stephen Bourne was a great fan of ALGOL, and had C preprocessor directives to "ALGOLify" C.

https://research.swtch.com/shmacro

The Bourne shell thus adopted ALGOL syntax, distinct from anything else in UNIX.

David Korn brought a pile of C, and wedged it into a max 64k program space (for Xenix), and his upward-compatible Korn shell (ksh88) had many, many features.

Then there were the "UNIX Wars"...

https://en.wikipedia.org/wiki/Unix_wars

The fallout of this was the choice of a few Korn features, but not all (no arrays, coprocesses, [some] eval, regex, et al.)

The wars defined the POSIX shell. Yes, it can be infuriating.

heavyset_go · on Oct 31, 2021

It's because Bash is everywhere and is thus the next least common denominator after sh.

chasil · on Oct 31, 2021

Bash is not everywhere.

There are two very distinct places where Bash is not, that I have found.

Bash is not in busybox. It pretends to be, but what is really there is the Almquist shell, with a bit of syntactic sugar to silence the common complaints.

Bash is also not in Debian's shell (/bin/sh). In that place, there is no tolerance of bashisms.

It is important to know the POSIX shell for these reasons.

jude- · on Nov 1, 2021

Bash is readily available everywhere, though. Is there a widely used OS that cannot run bash, but can run BusyBox ash or dash?

chasil · on Nov 1, 2021

Bash is not used in Android.

The Android userland explicitly avoids any GPL software.

The Android system shell is the MirBSD Kornshell (mksh).

This is (obviously) a massive platform.

jude- · on Nov 9, 2021

But, you can nevertheless install bash on Android. That was my ask -- show me a widely-used OS that cannot run bash but can run other similar shells.

gkfasdfasdf · on Nov 1, 2021

> Bash is also not in Debian's shell (/bin/sh). In that place, there is no tolerance of bashisms.

When writing a bash script, shouldn't you be setting the shebang to /bin/bash which is available on Debian?

chasil · on Nov 1, 2021

Bash is the default interactive shell, but the Almquist shell is the default for /bin/sh, and must be used for all system startup activity.

It is otherwise advisable when speed or portability are required.

throwaway984393 · on Oct 31, 2021

If you want to choose a programming language, there are hundreds to choose from. You could write a small program ("script") in Brainfuck. But that would be really annoying, and it would take you a long time to get anything done. Other languages may also take a long time to write, test, execute, and maintain. And they may be better at some things than others.

You have to step back and remember the point of all this. Why are you programming? To solve a problem, and make a human's life easier. What is the problem you are trying to solve? How can we make the human's life easier? In this case, it's "I want to combine bunch of programs together in a command-line interface, to make it easier to use these programs to get my work done on the command-line." So, what solution should we choose for this scenario?

Lots of different "scripting" languages exist (we used to call them "glue languages"). Today Python is the most popular, and before it was PHP and Perl. But who cares if they're popular? What are they good for? Why would you choose one over the other?

Python is a general purpose language which is easy to learn, easy to write, and easy to read. Well that sounds nice, but it's very "general" and not specific to our use case. PHP is a language designed for use in web development. Perl is a language designed for system administration-type tasks.

Bash scripting is designed for making it easy to combine already-existing programs in a command-line shell, with no modification or creation of programs required. The only "dependency" is the Bash interpreter (which happens to also be the command-line interface we started our problem with! how convenient!) and an existing collection of software tools that work with each other through a command-line interface. Every part of it is explicitly not designed to "make programming easier". Instead, it is designed to make "combining programs in the command-line" easier. The result is things like every single word you type is potentially just an outside program being executed, or arguments to that program. No other language has this quirky behavior, because they're not designed solely to make command-lines easier to use.

I know Perl well, and the language is fantastically useful as a system scripting language. It combines some of the best features of common shell scripting programs with more programming language-specific features. It is designed with shortcuts and "magic" to make quick work of command-line scripting tasks. But ultimately Perl was not built into the command-line, so its utility is always a bit stilted. Going between the shell and the Perl code and back to the shell isn't as flexible as if the Perl code was embedded in the command-line. Perl also doesn't have as simple a facility for pipes as a command-line shell does. Oh, and a Perl install is rather large, isn't available everywhere by default (anymore), and has all the usual dependency problems. So even though I know Perl very well, if I need to just combine existing programs in the command-line, I always use Shell scripting instead of Perl. (There are Perl shells which solve this problem, but then everyone needs to learn Perl, and Perl can be.... idiosyncratic)

You could also use Awk for a large number of general programming tasks, but again it's designed for a different specific problem: pattern-scanning and processing, not generally combining programs together.

So we use shell scripting to solve the problem because it was designed specifically for our problem, it's ubiquitous, and simple to use. You could use any other programming language, but it wouldn't fit our scenario as well. Once your use case changes, and you are no longer only trying to combine existing programs together, or the existing programs don't do what you want them to do, then you need to use a different language that better fits that new use case.

bsder · on Nov 1, 2021

> Is there a reason we aren’t using a shell with a proper programming language for scripting?

Mostly because the people who want to introduce a "programming language" into the shell don't prioritize being a shell.

Check out the "Oh" shell for contrast. This is what a programming language looks like when you force it to conform to being a shell first priority.

https://github.com/michaelmacinnis/oh

https://www.youtube.com/watch?v=v1m-WEZz46U

This is "Scheme-like" (actually the late John Shutt's Kernel-like) but has FEXPRs so things can be redefined and evaluation can be controlled.

rovr138 · on Oct 31, 2021

Perl used to be used a lot to automate things. I feel Python has taken some of that, but also have seen Bash being picked up more.

midasuni · on Oct 31, 2021

I tend to start writing things in bash (piping into things like grep, tac, sed, etc

However if I’m not careful they get painfully complex and I tend to regret not writing it in perk. As such I now switch to using Perl when I get to the function stage, or anything other than the most simple bash arithmetic.

lmm · on Nov 1, 2021

I don't know why "we" are. Just use Python. Or whatever language you're using for your main system (I've written "scripts" in Scala, sure it takes a second or two to warm up the JVM but it's honestly fine). Or TCL if you really must have a language that can be used as a login shell.

popcube · on Nov 1, 2021

shell is good enough, when people started to use script language, they just wanted to fill the empty between bash and C, no one want to replace it, so bash and other shell still there

pajko · on Oct 31, 2021

There's csh.

mgerdts · on Oct 31, 2021

In about 1995 I asked the question "how can a shell intended for C programmers not support functions?"

I've still not found a suitable answer.

dotancohen · on Nov 1, 2021

  > Getting anything out of a subshell that isn't from STDOUT is impossible.

Yes that is a known limitation that has horrible, horrible workarounds. Don't use them.

That said, for scripts whose main input _is_ from a file or STDIN, and also whose main output _is_ to a file or STDOUT, then bash is far more often the right tool for the job than it is given credit for. Of course, I'm talking about bash and a host of other utilities that are often packaged with it such as awk, grep, sed, cut, etc.

For processing text I find myself often choosing between bash and Python, and very often if I choose bash I'll feel that I've made the right choice.

1vuio0pswjnm7 · on Oct 31, 2021

"So nobody uses subshells."

djb has always used them, even in the shortest of scripts.

Otherwise I agree with everything in this comment.

There is a reasonable argument to avoid using bash for non-interactive scripts. The benefits of any additional bash features, so-called "bashisms", arguably do not outweigh the costs of making these scripts non-portable. One of the many advantages of shell scripts is that they are portable and have tremendous longevity; shell scripts can last a long, long time. There are no version changes and aggressive feature creep to worry about as is routinely the case with programming languages. The scripts just keep working, every day, and we can forget we are even using them, e.g., they are being used in the various ways by UNIX-like OS distributions.

One of the today's programmer memes was "Get Stuff Done". Maybe the shell was not meant for today's programmers. But for "sysadmins", or "DevOps", or whatever term anyone comes up with in the future, people who can "administer/operate" computers running a UNIX-like OS for themselves or for someone else like a client or an employer, the Bourne shell works for its intended purpose, better than anything anyone has come up with in the last 50+ years. There's a lot of stuff that "gets done" with the shell, whether it is on someone's own computer, their client's or their employer's.

Attempts at "shell replacements", cf. alternative shells, usually look like interpreters for programming languages, not shells. Perhaps this is not a coincidence.

Bourne shell is relatively small and fast. Computers with small form factors often use UNIX-like OS and when they do they usually include shells. Many programmers seem to dislike the notion that such a layer exists. They often try to blame the shell instead of their own lack of interest in learning to use it.

The shell is boring, and "boring" is sometimes the wisest choice. Most software has expanded to consume available resources (often the developer's choice not the user's). This makes computer performance gains difficult for the end user to discern, e.g., decade after decade, routine tasks seem to take the same amount of time. However the shell and many "standard" UNIX utilities have not changed as much in the same period. Subshells may have been "slow" many years ago. IMHO, they do not feel slow today. Routine tasks performed with the shell seem to run faster, as one would expect after hardware upgrade.

I posit: Life is too short to learn every programming language du jour but it is long enough to learn the Bourne shell, reasonably well.

1vuio0pswjnm7 · on Oct 31, 2021

"Shell" as used here means the Bourne POSIX-like shell, such as NetBSD's Almquist shell, not userland utilities that the shell may or may not call in a script. Scripts can of course test for differences in utilities where there is uncertainty.

If scripts written for a POSIX-like "lowest common denominator" shell were not portable, methinks projects written for such a shell, such as autoconf, would not work on so many diffferent systems. The amount of free software found in open source OS repositories that is built using "configure" shell scripts is not small.

Debian and other Linux distributions use a shell derived from NetBSD's Almquist shell. People who design and maintain these operating systems suggest that, for non-interactive use, this Bourne shell is faster than Bash.

zsz · on Oct 31, 2021

One of the principal things that "got done" using Bash (at least until 2017 -- after that, I couldn't say) off the top of my head was the GFS at NCEP. Yes, it was predominantly Fortran for number crunching; but what tied all the individual programs together was a super massive, maintained, and constantly modified and improved Bash script. And yes, there were plans in the works to switch out for something else (which is why I cannot say, four years later, what the status quo is). I should also mention, it was this Bash script that ran parallelized on the supercomputer -- and it was ultimately this script which, every three hours, produced the next set of forecasts, 24/7.

oweiler · on Oct 31, 2021

Bash scripts and shell scripts are not portable.

They have to shell out to other commands in order to do something meaningful.

Depending on your OS, these commands may not exist, or have a different set of commandline flags.

gpderetta · on Oct 31, 2021

Well you can use arbitrary file descriptors to get things out of a subshell, not just stdout. I had to do it when I had to get something out of band, while redirecting stdout through a pipeline.

Mind, if possible, it is even uglier than using stdout.

Groxx · on Oct 31, 2021

yeah, I was wondering if there was some reason that something like this wouldn't work:

  err_f="$(mktemp)"
  third_f="$(mktemp)"
  out="$(cmd 2>"$err_f" 3>"$third_f")"
  err="$(cat "$err_f")"
  third="$(cat "$third_f")"

Seems like that should work fine, and it's not even that odd looking or hard to understand. But I haven't tried, not sure if there's some surprise lurking in the depths of tempfiles or something.

gpderetta · on Nov 1, 2021

Normally I use mkfifo then immediately rm it after opening the file with exec.

The fact that there isn't a good way to open an anonymous pipe and get its two file descriptors in bash is very annoying.

cryptonector · on Nov 1, 2021

Few people know about `{varname}` I/O redirection...

gpderetta · on Nov 1, 2021

Yes. The fact that shell scripting is both so powerful and yet such an ugly language is painful.

quotemstr · on Oct 31, 2021

You're right about the disadvantages of subshells, but wrong about this:

> avoid bashisms

What's holier-than-thou is insisting that people stick to a dialect of shell scripting that hasn't changed in decades because some people think that conforming to POSIX is its own reward even if it makes life harder and programs worse.

No, thanks. I'll stick to bashisms. Things like coprocesses and filemap make certain classes of program much easier to write. And yes, shell scripts are programs, and the quickest way to turn these programs into unmaintainable balls of mud is to follow your advice to avoid normal programming best practices simply because the program you're writing happens to be a shell script.

presto8 · on Nov 1, 2021

Here is a script that shows exporting results from a subprocess to a parent shell using a temporary file:

  #!/usr/bin/env bash

  process_line() {
      if [[ $* == *foo* ]]; then
          echo "found_foo=1" >>$IPC
      fi
  }

  IPC=$(mktemp)
  find . -maxdepth 1 -type f | while read line; do
      # note: this is run a subprocess
      process_line "$line"
  done

  source "$IPC"
  echo "found_foo=$found_foo"

laumars · on Oct 31, 2021

> Bash is not a "real" programming language, so do not treat it as one.

Bash is definitely a different paradigm to your average imperative or functional language and thus requires a different approach but I wouldn’t go so far as to say it’s not “real”.

It certainly fits the criteria of a programming language even if it does have more warts than a pantomime witch.

ratww · on Oct 31, 2021

Yes.

Compared to other languages, Bash can be incredibly worse for some things, but much better for others.

People who don't realise that won't be able to get the best out of their toolset.

craftinator · on Oct 31, 2021

It is Turing Complete...

laumars · on Nov 1, 2021

I don’t think it makes much sense to judge software by whether it is Turing complete because otherwise that would make Minecraft a programming language

craftinator · on Nov 1, 2021

So are you deprecating the term "Turing Complete"? Mighty bold move there. And on that same note, have you seen the stuff being done in Minecraft? It is quite up for debate whether it is a programming language.

laumars · on Nov 1, 2021

> So are you deprecating the term "Turing Complete"?

No. It’s still an interesting yard stick. It just doesn’t describe programming languages in full. For example, and someone counterintuitively, it is actually possible to design a programming language that isn’t Turing complete. Some esoteric functional languages do actually fall into that category. But generally programming languages would be a subset of Turing complete software.

> And on that same note, have you seen the stuff being done in Minecraft?

I have, hence why I cited it as an example.

> It is quite up for debate whether it is a programming language.

One could debate anything but it doesn’t mean the argument is made in good faith.

Minecraft is a game. I’m happy to even extend the definition and say it’s supports some visual programming mechanics. But that doesn’t mean it is a programming language.

Just because something has 4 legs and a tail, it doesn’t automatically mean it is a dog.

foxes · on Nov 1, 2021

Yep bash is not a real language.

If anything you are saying it should be deprecated in place for a real language. If people want to do more advanced things why does the basic language of their shell not support it? Why should you not care about the correctness of your shell scripts which can run important things? Seems a perfect place for a functional language. It’s high level, you can just declare what you want. I imagine your interpreter can add in all sorts of checks to help make your scripting more correct.

addicted · on Nov 1, 2021

Powershell attempts to use a more scripting friendly language in the shell.

And it’s a horrible shell experience.

It’s great for scripting (but not as good as other scripting languages) but it’s a terrible experience on a day to day basis, with the only salvation being the existence of 2-3 letter aliases which make things slightly more manageable.

Bash is a great shell language that whose scripts are useful because it allows you to trivially convert your manual shell actions into an executable script.

If you need to do anything complex (or rather, something you probably wouldn’t type into the shell manually if you had to do the thing on a one off basis), then you’re probably better off using a different scripting language.

But changing BASH with scripting as a priority would probably make it a worse shell language (which is not to say there are no improvements to be made…there are).

kristianpaul · on Nov 1, 2021

No need for array when i can create files and combine their content with commands like tac and other coreutils text utilities.

tomjakubowski · on Nov 1, 2021

> So you can't define an array in a subshell and then use it outside the subshell, and you can't return an array (or anything that isn't a string) from a subshell

Huh. How does one return a value from a bash function? I've always only ever seen the "echo calling convention" used in scripts, or use of global variables.

kps · on Nov 1, 2021

Bash does have `typeset -n` (from ksh) which allows shell variables to be passed to a function by reference.

cryptonector · on Nov 1, 2021

Yes, and which gets you Tcl-ish pass by name semantics.

tomjakubowski · on Nov 3, 2021

Cool, thanks!

ReactiveJelly · on Oct 31, 2021

Just don't use Bash.

gavinray · on Oct 31, 2021

A lot of the time you don't even have the luxury of assuming "bash" is present, you've got only "sh".

Since all valid "sh" syntax is valid "bash", you've got to restrict yourself to only sh-valid code in the event the host doesn't have bash.

I have gotten used to doing this -- these scripts usually wind up being run in Docker containers that don't have "bash" installed for the sake of minimalism.

IE, you can't use [[ $predicate ]], but instead [ $predicate ], etc. Lot of subtle differences.

Spivak · on Oct 31, 2021

Not all sh syntax is valid bash — in bash [[ is reserved but not in sh. And if you want syntax that is valid but doesn’t do the same thing in sh vs bash there’s plenty of examples.

kevin_thibedeau · on Oct 31, 2021

You can't ever assume that /bin/sh is a POSIX shell so then you get limited to just an early Bourne subset.

mistrial9 · on Oct 31, 2021

I would agree with this and use python instead, but for uuhh "python problems with strings". IF you are 'blue sky' open, making new things, yes, sure. no BASH. But for things I built 8 years ago that are non-trivial, and I don't prioritize an entire rewrite, BASH works very well. And guess what, every two years that 8-years-ago just comes along with it. BASH is here to stay.

chasil · on Oct 31, 2021

The POSIX shell can sometimes achieve with a few lines what would take dissertations to do in C.

The supersets of POSIX are often even more effective.

Ugly as the tool may be, in the box it stays.

rhizome · on Nov 1, 2021

If you want to program, use a programming language.

zsz · on Oct 31, 2021

Sir! Yes, sir!

necheffa · on Oct 31, 2021

[flagged]

quotemstr · on Oct 31, 2021

Sometimes we want to write functions that have side effects. Subshell functions don't have side effects. Sometimes you don't want side effects, but sometimes you do, and it's silly to completely foreclose on even the possibility of using side effects.

yissp · on Nov 1, 2021

The article didn't say you should always use subshells for functions, no exceptions. Just that most of the time it probably makes more sense to do so. And subshell functions can still have side effects like interacting with the filesystem, they just can't modify the parent shell's environment.

chaps · on Nov 1, 2021

Isn't grep your friend in this situation? I'm really not sure why you can't have side effects in this scenario.

rablackburn · on Nov 1, 2021

A “side-effect” in this case is the ability for a function you call to do something “outside” of itself. For example, to change the value of a global variable.

It’s generally considered best practice (in the functional programming community at least) to write “pure” functions (that is, functions without side effects) because it’s much easier to reason about what they are doing.

So good news, subshells can _only_ be pure (the only way to get anything back from them is if they write some string to stdout), but sometimes you do actually want to have some side-effects (imagine a function that reads a config file and then wants to set some of the global variables to the values found there).

Well, bad luck. If you use the subshell syntax you literally can’t do that.

quotemstr · on Nov 1, 2021

What on earth does grep have to do with side effects?

chaps · on Nov 1, 2021

And what makes you think side effects aren't possible in the model described?

Think of grep as a way to control flow. If one side effect happens, send to stdout one thing. Another side effect, send something else. Then iterate over those outputs.

arendtio · on Oct 31, 2021

throwaway984393 <-- just a troll...

Edit: It seems I should elaborate.

> Getting anything out of a subshell that isn't from STDOUT is impossible. So you can't define an array in a subshell and then use it outside the subshell, and you can't return an array (or anything that isn't a string) from a subshell. If you only use subshells and want to use any kind of data structure that isn't a string passed from STDOUT, you have to do it globally.

Yes, but that is no problem at all, because that is the way shell scripts work and if you do it in a functional way, where the function has no state, it is great (we a few exceptions).

> And subshells are slow. So nobody uses subshells.

Shell scripts are slow in general, subshells don't make an exception but also don't have a large impact. And people do use subshells.

> If you use Bash for programming, you have to stop thinking in terms of the holier-than-thou software engineer, whose ego believes that a superior, "clean" design makes a superior program. You should embrace globals. You should switch between using or not using the enforcement of set variables or program exit status. You should stop using Bashisms and subtle, obscure language features unless you absolutely have to.

If you use Shell scripts, you should understand that this language has been designed decades ago and that professionals advise to use it just in short scripts to connect binaries.

> Bash is not a "real" programming language, so do not treat it as one. Do not look for hidden features, or try to do things in cute ways that nobody else uses. There is no superior method or hidden knowledge. Just write extremely simple code and understand the quirks of the shell.

That part is mostly okay, but "real" doesn't get the point, as it is a real language, just one with many problems. However, given its superior strength we haven't overcome it yet...

fstrthnscnd · on Oct 31, 2021

> However, given its superior strength we haven't overcome it yet...

"Superior strength" is probably just a matter taste, however I would really like to read your explanation on why you think so.

arendtio · on Nov 1, 2021

Well, if it would be just bad, why do we still use it? Why don't we use e.g. Python (which is a much better designed programming language)?

Some reasons why POSIX shells are great:

- they are part of POSIX (a Standard which is implemented by multiple OS) and therefore often available

- you can use them to compose programs written in different languages

- you can easily connect the output of one program with the input of another (pipes)

fstrthnscnd · on Nov 2, 2021

so it's not Bash by itself which is great, it's POSIX shells?

> they are part of POSIX (a Standard which is implemented by multiple OS) and therefore often available

That's not an intrinsic property of Bash.

> - you can use them to compose programs written in different languages

> - you can easily connect the output of one program with the input of another (pipes)

This last point subsumes the previous one. So basically, you're saying Bash is great because it's a shell?

arendtio · on Nov 2, 2021

Basically, yes. Most arguments expressed here apply to Bash as well as to other POSIX compatible shells. Even the trick from the initial blog post is also works for POSIX shell AFAIK (didn't check). Definitely working is something like (which is just a little more verbose):

  myFunc() { (
    #...
  ) }

So in this case I don't think it makes sense to make a distinction between the two. In essence, I think Bash and the other POSIX shells share most of their greatest strengths and weaknesses.

fstrthnscnd · on Nov 2, 2021

But, wasn't the original argument discussing the merits of Bash as a shell? So you're saying that Bash is great as a shell because it's a shell?

arendtio · on Nov 4, 2021

My understanding was, that the initial argument was about Bash as a programming language and most aspects discussed were aspects were Bash is no different than POSIX:

1. Return values of subshells 2. Clean design 3. being a "real" programming language

The only Bash specific part was

> You should stop using Bashisms and subtle, obscure language features unless you absolutely have to.

But in fact, the usage of subshells to limit the scope and subshell return values have nothing to do with Bashisms.

pdkl95 · on Oct 31, 2021

> Well, because it simply doesn't work for them: returning from a function does not trigger the EXIT signal.

It doesn't trigger EXIT, but it does trigger RETURN. Just trap both:

    #!/bin/bash

    foo() {
        trap "echo 'Cleanup!'" RETURN EXIT

        #return
        #exit
    
        echo "Kill me with ^C or \"kill $$\""
        while true ; do : ; done
    }

    foo   # should print 'Cleanup!' on SIGTERM,
          #   returning, or calling exit

xeyownt · on Oct 31, 2021

Wow, this is great! Did not know that, thanks for sharing!

zouhair · on Nov 1, 2021

If you use it in a script it works, but if you "source it" to be used a command it won't work. I wonder why.

Xophmeister · on Oct 31, 2021

Here's an interesting bit of...abuse :) Since Bash is effectively stringly-typed, it can be used as a functional programming language, with pipes similar to function composition.

e.g.: wtfp.sh

    #!/usr/bin/env bash

    map() {
      local fn="$1"

      local input
      while read -r input; do
        "${fn}" "${input}"
      done
    }

    reduce() {
      local fn="$1"
      local init="$2"

      local input
      local output="${init}"

      while read -r input; do
        output="$("${fn}" "${output}" "${input}")"
      done

      echo "${output}"
    }

    filter() {
      local fn="$1"

      local input
      while read -r input; do
        "${fn}" "${input}" && echo "${input}"
      done
    }

    add() {
      echo $(( $1 + $2 ))
    }

    increment() {
      echo $(( $1 + 1 ))
    }

    square() {
      echo $(( $1 * $1 ))
    }

    even() {
      return $(( $1 % 2 ))
    }

    sum() {
      reduce add 0
    }

    map increment | map square | filter even | sum

...then:

    $ printf "%s\n" 1 2 3 4 5 | ./wtfp.sh
    56

heavyset_go · on Oct 31, 2021

This is the pipe mill[1] pattern. As you say, you can use it to mimic function composition from a functional paradigm. It can make for some elegant solutions to dealing with streams of data.

The issue is that pipe mills are very slow.

    $ mill() { while read -r line; do echo $line; done }
    $ export FIVE_MEGS=$(( 5 * 1024 ** 2 ))

    $ time yes | mill | pv -S -s "$FIVE_MEGS" > /dev/null
    5.00MiB 0:00:23 [ 221KiB/s] [============>] 100%            
    real    0m23.084s
    user    0m14.121s
    sys     0m26.780s

    $ time yes | pv -S -s "$FIVE_MEGS" > /dev/null
    5.00MiB 0:00:00 [6.55GiB/s] [============>] 100%            
    real    0m0.005s
    user    0m0.000s
    sys     0m0.006s

Even Python loops are faster.

    $ export PYLOOP="from sys import stdin
    for line in stdin:
      print(line)"

    $ time yes | python3 -c "$PYLOOP" | pv -S -s "$FIVE_MEGS" > /dev/null
    5.00MiB 0:00:00 [67.1MiB/s] [============>] 100%            
    real    0m0.082s
    user    0m0.071s
    sys     0m0.019s

[1] https://en.wikipedia.org/wiki/Pipeline_(Unix)#Pipemill

kasabali · on Nov 1, 2021

Did you try running the same benchmark in dash or ksh? Looping in bash is criminally slow.

kasabali · on Nov 3, 2021

I did the comparison if anyone is curious.

dash is faster but not as much as I'd expected:

    $ time yes | bash mill | pv -S -s "$FIVE_MEGS" > /dev/null
       5MiB 0:00:24 [ 208KiB/s] [====================================================>] 100%            

    real 0m24.614s
    user 0m21.338s
    sys 0m11.749s

    $ time yes | ksh93 mill | pv -S -s "$FIVE_MEGS" > /dev/null
       5MiB 0:00:11 [ 447KiB/s] [====================================================>] 100%            

    real 0m11.451s
    user 0m6.480s
    sys 0m12.441s

    $ time yes | dash mill | pv -S -s "$FIVE_MEGS" > /dev/null
       5MiB 0:00:07 [ 663KiB/s] [====================================================>] 100%            

    real 0m7.720s
    user 0m4.687s
    sys 0m8.934s

awk is super fast as expected:

    $ time yes | awk '{ print $0 }' | pv -S -s "$FIVE_MEGS" > /dev/null
       5MiB 0:00:00 [11.5MiB/s] [====================================================>] 100%            

    real 0m0.439s
    user 0m0.444s
    sys 0m0.011s

dllthomas · on Oct 31, 2021

I enjoyed how my bash 2048 came out: https://github.com/dlthomas/bash2048/blob/master/2048.sh

ridiculous_fish · on Oct 31, 2021

Fun shell function fact: used to be if you `break` or `continue` in a function without a loop, bash would find the loop:

    breaker_breaker() { break; }
    foo() { breaker_breaker; }
    while true; do
        echo Loop
        foo
    done

bash dynamically crawled up the call stack until it hit a break-able loop. If you squint it almost looks like exception handling! Anyways this no longer works in bash, though it still does in zsh.

barosl · on Oct 31, 2021

That's very impressive, it almost feels like a legit useful hack. Why was it not fixed in zsh? Did they decide it was working as intended?

naniwaduni · on Oct 31, 2021

Well, bash's new behaviour (noisily complain and do nothing, reporting success) doesn't do anything useful, so implementing it gratuitously breaks old code to no gain.

Dynamic stack-crawling is more or less the most popular historical behaviour (though afaict the Bourne/Korn lineage just silently fails breaks outside the enclosing function); it's just generally consistent with everything else in the shell that's globally/dynamically scoped. Even in the shells where you can't break out of your enclosing function, `eval break` still breaks through loops in your current context, and that kind of looks like a function you called that called break if you squint.

(POSIX expressly leaves this case undefined with a carveout for break/continue inside the body of a function lexically contained within a loop.)

dllthomas · on Oct 31, 2021

> If you squint it almost looks like exception handling!

It also reminds me of TCL's "uplevel": https://www.tcl.tk/man/tcl8.4/TclCmd/uplevel.html

cryptonector · on Nov 1, 2021

Bash does kinda have something like uplevel, and I use it: `typeset -n`.

dllthomas · on Nov 1, 2021

That's more pass-by-reference, right? Which can be used to set things in the calling context, to be sure, but seems meaningfully different from "run this code N frames up", partly because it is limited in what it can do and partly because it can only change variables that you are actually mentioning in what you pass in.

(... unless I miss something.)

cryptonector · on Nov 1, 2021

No, it's pass-by-name, which can feel like pass-by-reference. And yes, it's only that.

dllthomas · on Nov 4, 2021

I usually associate "call-by-name" with laziness. Isn't this more pass-by-reference that may happen to be implemented using names under the hood? Alternatively, how would you distinguish them?

Sniffnoy · on Oct 31, 2021

Heh, kind of like dynamic scoping, except as applied to control flow...

wruza · on Nov 1, 2021

Perl does that as well.

VWWHFSfQ · on Oct 31, 2021

> Essentially, the only "advantage" of not running your functions in a subshell is that you can write to global variables.

subshells are ridiculously slow because it's a forked child process. imagine forking for every function call in your program....

> I simply do not understand why people keep recommending the {} syntax at all.

because it's almost always what you actually want.

SubiculumCode · on Oct 31, 2021

I'd argue that most of the work in a bash program is done by functions like find, grep, etc, and that the time to fork is not all that relevant. We don't program the same kinds of things in bash that we might in C++

fiddlerwoaroof · on Oct 31, 2021

Yeah “fork is slow” is the sort of microbenchmark that is mostly irrelevant for shell scripts: every command you run in a script is basically a fork.

dataflow · on Oct 31, 2021

> every command you run in a script is basically a fork

Not for built-in commands.

> Yeah “fork is slow” is the sort of microbenchmark that is mostly irrelevant for shell scripts

Maybe if you're on a Linux kernel, but not everywhere else.

Spivak · on Oct 31, 2021

Where are you running bash where forks are expensive? Like sure Windows exists but bash on WSL is running a Linux kernel.

dataflow · on Oct 31, 2021

Not quite. WSL2 uses a Linux kernel. WSL1 uses a Windows kernel and fork is much slower there. Also there's userspace variants like MSYS2, Cygwin, etc.

chasil · on Oct 31, 2021

When I am using busybox sh/bash, I am very, very careful not to fork unless I must.

For mass processing, I will use xargs to minimize the number of processes created.

plorkyeran · on Oct 31, 2021

Assuming that fork is fast everywhere is how you end up with things like ffmpeg's configure script that runs in seconds on linux and _minutes_ on Windows.

VWWHFSfQ · on Oct 31, 2021

if you want to fork your function call then you do it explicitly with $(my_function). I'm aware that people are always discovering things for the first time but there is literally decades of thought that has gone into why bash behaves the way it does. and there's a pretty good reason why the bash authors decided not to make function calls fork by default...

chaps · on Oct 31, 2021

Sure it's slow on startup, but if you design your functions to pipe to each other as if you were writing purely-functional code, startup time is mostly irrelevant. Function startup happens once and from there they just feed down to the parallel-by-default pipe stream.

fulafel · on Nov 1, 2021

Everything is relative, a $(true) is on the order of 0.1 milliseconds. If this makes something ridiculously slow, you may be implementing a too big part of your work in bash!

nickjj · on Oct 31, 2021

You can also turn a Bash function into a script command with almost no effort, such as making a file called "run" and putting this in it:

    #!/usr/bin/env bash

    set -eo pipefail

    function hey {
      echo "Hey!"
    }

    TIMEFORMAT=$'\nTask completed in %3lR'
    time "${@}"

Now after running a `chmod +x hey` can run it with: ./run hey

Feel free to replace "time" with "eval" too if you don't want your command timed.

This is a really useful pattern because it means you can create a "run" script with a bunch of sub-commands (private or public), auto-render help menus and create project specific scripts without any boilerplate. Bash also supports having function names with ":" in the name so you can namespace your commands like "./run lint:docker" or "./run lint:frontend".

I have a practical example of this sort of thing here: https://github.com/nickjj/docker-flask-example/blob/main/run

I've written about this pattern in more detail here https://nickjanetakis.com/blog/replacing-make-with-a-shell-s.... It's basically a less limited Makefile for when you want to make project specific aliases. This is something I use all the time now.

quotemstr · on Oct 31, 2021

One of my biggest pet peeves is people defining commands and utilities as shell functions and demanding I source some environment setup script instead of just making regular command scripts and running them as normal programs.

My shell is my computing environment. It's rude for a script author to make me change my computing environment when he could just as easily have made a script and created his own sandboxed environment that wouldn't interfere with mine.

People should never ship end user interfaces as bundles of shell functions that have to be sourced.

(I'm looking at you, Android build system.)

earthboundkid · on Nov 1, 2021

I remember when I first interacted with Ruby Gems and learned I needed to source some file to make it work, I thought, "Wow, Ruby is so weird and magical they can't even use the shell normally." The joke was on me when Python virtual envs became a thing.

abbeyj · on Nov 8, 2021

If you don't want the timing, use `"${@}"`, not `eval "${@}"`. The latter will parse all of your arguments through the shell an additional, unwanted time.

For example, take the command `./run echo "Don't"`. When using `time`, this will print `Don't`. If you use just `"${@}"`, it will also print `Don't`.

But if you use `eval "$@"`, this will crash with an error like

    eval: line 10: unexpected EOF while looking for matching `''

anitil · on Nov 1, 2021

One thing I like about make is that you get an automated bash completion for targets, which I find helpful. Is there an equivalent sneakiness for doing this with this style of script?

nickjj · on Nov 1, 2021

I have not found a way to do that, but other Bash driven CLI tools do offer completion. It might come down to having to define a separate completion file. Something like this: https://askubuntu.com/questions/68175/how-to-create-script-w...

If you check the first link in my previous reply it does include an auto-generated help menu that uses compgen to print a list of functions near the bottom. This way you can run `./run` or `./run help` to get a list of commands. I find this is helpful enough without needing completion, especially since every function ends up being a top level command.

anitil · on Nov 1, 2021

Ah yes I did see the compgen bit, and promptly stole it for all my scripts

francislavoie · on Oct 31, 2021

I do the same thing, but slightly differently. https://github.com/francislavoie/laravel-websockets-example/...

earthboundkid · on Nov 1, 2021

I learn about this technique reading @chubot, and now I use it all the time.

wruza · on Nov 1, 2021

chmod +x hey

Did you mean chmod +x run?

nickjj · on Nov 1, 2021

Oops yeah, good catch!

boardwaalk · on Oct 31, 2021

People do underestimate the shell. Particularly I see people shoehorning collections of commands into a Makefile, when a shell script would work just fine.

On the other hand, with a lot of glue work I do, I eventually want to something more complex (use lists and maps, complex string building and regexes, date handling) and while you /can/ do that in bash, I might as well start in Python and have everything in one language and take advantage of things like code sharing via modules. (And yes you can share code in shell, but again it’s not as nice.)

fragmede · on Oct 31, 2021

Might as well, yes, but I've found writing shell scripts in Python to be cumbersome because whatever flavor of os.system() I end up using just doesn't work well syntactically. I can run a command and pipe to a bunch more commands way easier in a shell because I'm already using a shell when interacting with the computer. Perl had this figured out, but proved unable to continue evolving (aka adding types, like Python/Ruby/JavaScript have managed to.)

If there's a modern library/workflow that makes this not the case, I'm all ears!

ilyash · on Oct 31, 2021

> writing shell scripts in Python to be cumbersome because whatever flavor of os.system() I end up using just doesn't work well syntactically.

This is exactly how I felt. Bash can't handle structured data well. Python (being general purpose programming language) can't handle calling external programs well because it's not the "focus" of the language. My shameless plug solution is a "real" programming language that can do both well (along with proper handling of exit codes and more goodies for "devops"y scripting).

https://github.com/ngs-lang/ngs

More about this feeling when you don't have a well fitting language for "devops"y scripting - https://ilya-sher.org/2017/10/10/why-i-have-no-favorite-prog...

hiyer · on Oct 31, 2021

> Bash can't handle structured data well. Python (being general purpose programming language) can't handle calling external programs well because it's not the "focus" of the language.

Perl and Ruby do both very well. Or at least they do the latter in a simpler way than Python, and they're no worse than it for the former.

laumars · on Oct 31, 2021

There’s a few of these types of shells floating about these days.

I personally use murex but Elvish seems popular too.

ilyash · on Oct 31, 2021

NGS is programming-language-first while other modern shells are typically shell-first. Multiple dispatch would probably be the most prominent manifestation of this approach in NGS.

laumars · on Nov 1, 2021

> NGS is programming-language-first while other modern shells are typically shell-first.

Not always no.

Even if your point were true, being programmer-first is not always a desirable feature. The vast majority of shell work is basic and repetitive. Most of the time people just want something that functions a little like Bash but less shitty for scripting. The fact that Powershell, LISP nor Python haven’t taken over the world for shell usage proves that a higher level language REPL generally makes for a shit daily shell. So usually you end up with just a small few purists using it. And that’s not really good enough.

Whereas Oil, Elvish, Murex and even Fish (to a less dramatic extent) are looking at what makes a good shell and then fixing shell scripting within that shell. Having gone through the REPL phase myself with a great many different languages and found them all painful for daily use, I’m inclined to agree with shell-first approach.

> Multiple dispatch would probably be the most prominent manifestation of this approach in NGS.

Again, NGS isn’t unique in that regard. Murex has methods, Powershell has methods. I’ve not seen anything new in NGSs “Multiple Dispatch” nor “MultiMethod” docs that other alt shells aren’t also doing.

Don’t take this the wrong way, it looks very impressive what you’re doing. But it’s not unique these days. And I say this having tried a great many options out there.

ilyash · on Nov 1, 2021

> being programmer-first is not always a desirable feature.

programming-language-first allows convenient scripting. I could not think of a way to make anything shell-first be convenient for scripting (anything beyond tiny scale).

> higher level language REPL generally makes for a shit daily shell.

I think it proves that general purpose languages, where using them as a shell was afterthought "makes for a shit daily shell".

NGS, on the other hand, has somewhat-bash-like (read: easily run external programs) syntax at the top level, which should be good for CLI. Want to use more advanced features that are typically associated with "real" programming languages? OK, pay the price, switch syntax with { ... } and have full blown programming language at your disposal, in the shell.

> NGS isn’t unique in that regard.

I do scan alternative shells from time to time. I could have missed but I didn't see multiple dispatch in any of them. Which ones have it? Just to clarify: In which shell you can define several methods with the same name and when called, the method to invoke is selected based on the types of the arguments?

> Murex has methods, Powershell has methods.

Just looked at Murex docs. No multiple dispatch mentioned. I did not even see how to define named parameters - https://murex.rocks/docs/commands/function.html

There is a huge difference in having methods and multiple dispatch.

> not unique these days

NGS is a mixture of "borrowed" features and unique ones. Multiple dispatch is an old concept and has been implemented in other programming languages. Examples of things in NGS that I have not seen anywhere else: syntax for run-command-and-parse-output, proper handling of exit codes.

Regarding exit codes. Typical approach to exit codes varies. Python - "I don't care, the programmer should handle it". Some Python libraries and other places - "Non-zero is an error". That's simplistic and does not reflect the reality in which some utilities return 1 for "false". bash (and probably other shells too) is unable to handle in a straightforward manner situation where external command can return exit codes for "true", "false" and "error". It just doesn't fit in the "if" with two branches. NGS does handle it with "if" with two branches + possible exception thrown for "error" exit code.

Edit: and some other features around handling exit codes such as short syntax to provide expected exit code (any other exit code throws exception)

Edit: clarification - NGS knows that some external programs have exit code 1 which does not signify an error.