My experience doing integration testing with BATS was that it was complete misery (not the BATS project’s fault, I think it’s just inherent in the nature of the thing).
You REALLY want a real programming language for any kind of non trivial integration testing. Things like set up and tear down, checking outputs, and testing many-similar-but-not-identical cases gets hairy fast. Sure you CAN do it in bash but just because you can doesn't mean you should. When we rewrote our BATS tests in Go was a huge step forward for the project.
One time I had to write a Json marshaller in bash (don't ask), and bats was invaluable. BATS is great when you need to unit test bash. And for the most basic integration tests on something that is meant to be a CLI.
Outside of the scope of elaborate CI pipelines, I wonder how useful this can really be.
Big CI pipelines are one of the few instances where I can think of Bash being both an appropriate choice AND the resulting product being large, elaborate, and sensitive to failure - which would benefit from being tested. Most other applications of Bash are generally just so simple that fundamentally altering how you write scripts ("Bash scripts must also be broken down into multiple functions, which the main part of the script should call when the script is executed.") for the sake of testing them seems like it could easily fall into the category of over engineering.
Beyond the CI pipeline use case, wouldn't the tools in which this would actually be properly useful be better off written in a proper programming language?
Whenever someone says "most people don't write large Bash scripts", I have to chime in and say that I would consider most GNU/Linux distros to be giant piles of shell scripts.
A year or two ago, Arch Linux migrated the tests for dbscripts (the server-side of how package releases happen) from shUnit2 to BATS. https://git.archlinux.org/dbscripts.git/
I would consider most GNU/Linux distros to be giant piles of shell scripts.
Yes! That was one of the primary motivations for my Oil project [1]. I was building containers from scratch with shell scripts (in 2012 or so, pre-Docker), and I was horrified when I discovered how Debian actually works.
And of course it's not just Debian. Red Hat, Fedora, Alpine, etc. are all big packages of shell scripts, often mixed with other ad hoc macro processing or Makefiles. Alpine does this funny hack where their metadata is in APKBUILD shell scripts, which is limiting when you want to read metadata without executing shell.
I also point out in that post that Kubernetes is pretty new (2014) and it has 48,000 lines of shell in its repo.
That's true of most cloud infrastructure. If you deal with Heroku, OpenStack, Cloud Foundry, etc. there is a ton of shell all over the place. With buildpacks, Travis CI, etc.
And that's obviously not because the authors of those projects don't know what they're doing. Shell is still the best tool for that job (bringing up Unix systems), despite all its flaws.
The world now runs on clusters of Unix machines, and in turn big piles of shell scripts :)
> I don't know how bats works with the @test annotation? That doesn't look like valid shell syntax.
Bats runs the test files through a preprocessor[1] that, among other things, converts each `@test` case into a shell function. E.g. `@test "the doohickey should frob" { ... }` becomes `test_the_doohickey_should_frob() { ... }`.
Also, I looked at the buildpacks-ci repo you linked, and it still has ~2000 lines of shell in ~80 files.
For comparison, there's ~12,500 lines of Ruby.
Another issue I have is that if Kubernetes replaces its 44,000 lines of shell with 100,000 lines of Go (or probably more), that's a mixed result at best. There's just a lot of logic to express and shell does it concisely.
Of course, without a better shell, I don't blame them if they switch, but it's still a suboptimal state of affairs.
I agree bash is not a sane production language, and apparently so does Greg Wooledge, somebody who has maintained a popular bash FAQ for a very long time. I quoted him in my latest release announcment:
I did hear about Cloud Foundry moving from shell to Go for some things, and I also heard that Kubernetes was rewriting a lot of its shell. [1]
(1) If you had to choose between Go and bash, I can understand choosing Go. I saw some blog posts along those lines [2].
Although I don't think it's optimal. I'd be interested in seeing some of those "shell scripts in Go" if you have a link. There should be a better language for sane shell scripting (hence Oil).
(2) I imagine it's not fun to port thousands of lines of shell to Go (or Ruby) by hand. Oil is supposed to help with that via an approximate translation and good errors, but that part isn't done yet.
What IS close to done is a dialect of bash that is sane -- or CAN BE MADE sane with user feedback.
For example, in the latest release I added set -o strict-argv, and there's also set -o strict-control-flow and strict-word-eval. I will add a strict-ALL to opt into all at once, as mentioned in the release notes.
Also, OSH gives you all your parse errors at once, rather than having them blow up at runtime later:
[1] I just pulled the Kubernetes and there's 44K lines in *.sh files, as opposed to 48K lines a couple years ago. If it were proportional to the project's growth, I would have expected it to be 100K lines by now, so it seems they are indeed getting rid of shell!
However this only removes ONE LAYER of shell. Any cloud service that uses a Linux distro (which is all of them) is papering over all the nasty layers of shell underneath! I hope that Linux distros will gradually move to Oil to get rid of this legacy.
Bash is proper programming language, though not necessarily modern nor ergonomic.
The problem is:
1. Bash is assumed to be everywhere, so people use it for maximum portability or bootstrapping.
2. It started as a 10-100 line "quick" script, but then grew into a monster and nobody wanted to take the hit to rewrite it in a modern programming language.
I've found when you enforce the same software best practice requirements regardless of language, people start choosing not-Bash since "they have to do it right anyway". Many devs see Bash as a shortcut to avoiding the extra work.
Tangential question, but are there any efforts to treat shell scripts as a compilation target? It seems like such an insane language, I don't know why another nicer language hasn't emerged on top of it.
I can understand that because I thought that way too before I really learned it.
Bash is a language like any other: it takes reading and practice to get good at it. Also most of the WTFs in Bash actually make decent sense when you consider that whitespace is the delimiter in bash (for example setting a variable is VAR='value' instead of VAR = 'value'. That trips newer people up pretty badly at first).
Most people also abandon good software practices when they write bash for some reason. For example, you'd never write 100 lines of ruby or javascript without declaring functions, yet people do that all the time in bash (they shouldn't). If people followed good practice, I think bash would seem a lot less insane.
I think lack of proper arrays and structured data types is a big issue with bash. Basically everything is a string, with all the issues around escaping and corner cases that that implies
Yeah very fair. I actually thought I included a line like this: "If you need arrays, hashes, or floating point math then bash is a poor choice" but I must have deleted it (I've been trying to use more brevity when I write).
But the main point of the shell and of scripting the shell is to execute programs and creating pipelines.
If you need to do complex things, or you need special data structures or you need to process the output directly yourself in any significant way, then shell scripting is simply not the right tool for the job.
And in the case where you want to use another language why do you want to transpile to shell script? Better to just install Python 3 or something on the server. (It might even be installed there by default already.)
Yes! So much yes. Many developers write crappy Bash scripts because they don't see it as a real language, but it's an interpreted language just like Ruby, Python, Perl, etc.
If you're going to design modular libraries and scripts with best practices, unit tests, etc, for those languages, then you should be doing the same with Bash.
If you're writing production scripts using Bash, you should be following software best practices - don't give me any "it's just Bash" excuses.
Mocking In addition to bats-mock (part of the bats library in the article), there is another mocking approach: https://pbrisbin.com/posts/mocking_bash/ They both have their uses.
Argument Parsing Use Docopt ( http://docopt.org/ ) to get modern arg-parsing in your Bash scripts. Don't waste your time trying to roll your own or use the built-in anemic arg-parsing. Shells implementation: https://github.com/docopt/docopts
BUT why are you using Bash? Go use a modern language and don't suffer the idiosyncrasies of Bash where you have to re-invent many common libraries already available like JSON parsers, etc.
My favorite Bash replacement is Python + Plumbum: https://plumbum.readthedocs.io/en/latest/ All the convenience of Bash when invoking commands (no more subprocess()!) with the joy of Python.
Shellcheck has saved me countless times, but I noticed it's hard to get people to adopt it into their workflow.
I've introduced probably a dozen people to shellcheck and warn them ahead of time that their scripts are broken. They try it, see a flood of errors and warnings, close the window, and never use it again.
The logic surrounding their undefined variables and improper conditions happen to work as hoped, by luck, combined with their strict adherence to "it's best practice to not use spaces" both keep their scripts crawling along "just fine".
> If you're going to design modular libraries and scripts with best practices, unit tests, etc, for those languages, then you should be doing the same with Bash.
You win for enthusiasm when it comes to that syntax. I appreciate I pipe output to uniq (MacOS) to get uniques, proving that easy is an extremely relative value :)
You REALLY want a real programming language for any kind of non trivial integration testing. Things like set up and tear down, checking outputs, and testing many-similar-but-not-identical cases gets hairy fast. Sure you CAN do it in bash but just because you can doesn't mean you should. When we rewrote our BATS tests in Go was a huge step forward for the project.