Write plain text files

imgabe · on March 2, 2022

Another advantage to plain text files: source control. You can check your writing into git and get a history of all your edits.

It’s something programmers take for granted, but it would be amazing if this got more widely adopted outside of tech. The number of files with names like “Report Final Final draft v3.docx” is truly staggering.

“Git for everything“ would be a multi-billion dollar startup easily.

phreack · on March 2, 2022

I once thought git for humans would be a great idea but never got around to speccing it out. Later on, a lawyer friend showed me the software they used 'for backup' (that they paid thousands per month for) and it turned out everything about it was just exactly like SVN. The terminology was different, the UX was laser focused to the intended users, but at the end of the day it was commits, syncs, merging, pretty much everything but branches, just laid out with domain specific language instead, professional office UI and simple UX.

So, uh, yeah I agree absolutely with you.

awr · on March 3, 2022

What was the tool called? We're currently looking to solve a problem for a client to better manage contract negotiation between his lawyers and the purchaser's lawyer (apartment units). ATM it's all done via email but I imagine there'd be heaps of tools out there.

ben174 · on March 2, 2022

I tried to git my resume once upon a time. It's still something I'd love to finish doing, it just makes sense to have a git repository as a timeline of your life:

https://github.com/ben174/bugben

robin_reala · on March 2, 2022

My official CV is a page on my site, that I print to PDF if someone asks for it. The page itself is as source-controlled as the rest of the site.

thefunnyman · on March 3, 2022

I do the same by having my resume in latex which I’ve checked in to git. Been meaning to write a GH action to generate the pdf artifact automatically.

tomrod · on March 4, 2022

I've decided to migrate my CV to Markdown instead -- pandoc can dump to LaTeX, PDF, you name it.

chrisfinazzo · on March 4, 2022

I eventually settled on LaTeX - although I considered dropping all the way down to groff/troff and having TeX be an intermediate step in the process.

Custom LaTeX classes made this more trouble than I was willing to deal with, so I decided to not pursue it further. I suspect Markdown might have similar challenges dealing with this, although given that the "verbose mode" is just HTML, I might be able to make it work.[1]

The print media type has been around for eons, but the @page rules don't support everything I need and are generally absent in WebKit browsers.

[1]: https://github.com/chrisfinazzo/resume

vertis · on March 2, 2022

Enterprise Content Management like EMC Documentum has this sort of thing baked in (it's been a long time, but they're definitely on top of it). But that's for big companies.

Zak · on March 2, 2022

I had a similar thought some time ago and concluded it's an impossible task. The problem is that it would need to understand every file format its users care about and be able to represent changes in a useful way.

How do you merge destructive edits to image files?

yencabulator · on March 4, 2022

Many version control systems aimed more toward digital assets do some sort of locking, so only one user edits a file at a time. Advisory or mandatory, with support for breaking locks, etc.

tracker1 · on March 3, 2022

git-lfs with appropriate file types set to use it.

sslayer · on March 2, 2022

LOL,sweet summer child, its all 0's and 1's

munk-a · on March 2, 2022

Please explain, in 0's and 1's how to combine the change to add a dark splotch over here and a patch of blue over there so that the image you get out of the merge has both the dark splotch and the patch of blue... without teaching git how to comprehend image file formats (and assuming you're not using a bitmap which is sort of a free win).

D13Fd · on March 2, 2022

Honestly I think modern Sharepoint/OneDrive based .docx files have already leapfrogged git for the average user.

In an office environment, modern MS Word gives you a really nice version history, with automatic or manual/named versions, plus multi-user simultaneous editing, with a slick and fast UI. It’s even cross platform - I send a coworker a link to a document, and they can open it and work on Word for Windows while I edit the same doc on Word for Mac or even on the web-based Word.

It’s shockingly good.

CorrectHorseBat · on March 2, 2022

After not using Word for years we've been using it for a few months now for a project, expecting it would be decent.

While some of the ideas are great, the overall experience was pretty bad. Leaving autosave on often slowed down Word to an unworkable pace and after a few other issues (I'm not exactly sure what, wasn't involved) we just made sure only one person at a time works on the document.

Some other annoying small issues I experienced: when working together sometimes your undo history is gone. I guess that's because someone else edited too and correctly undoing something someone else edited in between is not simple. IMO it's impossible and any system doing merges automatically will fall apart at some point.

Track edits cannot track deleting rows of a table

Wed based word works actually pretty fine, but it misses random features. Adding captions seems not to be supported.

yarky · on March 2, 2022

I agree, I also expected the whole office365 ecosystem to do what's supposed to, but I also found it useless due to the slowness and errors the whole team kept getting while working on the same file. That and the fact that it somehow feels like the Mac version of their products always comes with a bonus bug. Don't event get me started on their web versions.

It's the little things...

D13Fd · on March 2, 2022

Were you using it on Linux or something? Or maybe with very large documents?

I use it every day and really haven’t had slowness issues, even when running via Parallels on the Mac side. Of course, typically my docs are <100 pages and light on images.

I have seen undo history go away when edits collide with multiple users, but that’s really the only way it could work. And you can still view old versions of the doc if you need to retrieve/restore something.

MS Word as a whole is definitely very far from flawless, but it’s still better than the alternatives I’ve seen, and it’s extremely entrenched. That’s why it’s hard to see “git for office docs”succeeding.

CorrectHorseBat · on March 2, 2022

>Were you using it on Linux or something? Or maybe with very large documents

~250 pages, ~100 images, not something I expect a modern computer to struggle with. We used the web version on Linux sometimes, but I think it also happened when we didn't.

>I have seen undo history go away when edits collide with multiple users, but that’s really the only way it could work. And you can still view old versions of the doc if you need to retrieve/restore something.

The other way is like git does it, manual merging and conflict resolving. I don't think there's a way around that once you work with enough people together on a file.

D13Fd · on March 2, 2022

That makes sense, although obviously the downside to a manual check-in/merge process is that it greatly slows down the editing flow.

Personally for the kinds of documents I work on, I'd rather risk losing my undo history than have to deal with manually merging changes from multiple users.

Plus, without real-time concurrent editing, the changes are much more likely to conflict.

srazzaque · on March 2, 2022

It's great, until it isnt. For small and simple documents (sub 20 pages), there's no better option.

In my experience it starts to creak on large documents with multiple editors. I did a project many years ago where the key deliverable was a large report, worked upon by 3-4 contributors over 12 weeks.

That last week was absolute madness. Document got corrupted, and we tried all sorts of things. We ended up importing the corrupt document into Google Docs, which, oddly enough, did a better job of working with a corrupt Word document than MS Word did.

jmmcd · on March 2, 2022

I've seen online Word become extremely slow even for small documents with a few editors.

It doesn't help that there are lots of versions of online Word - there's the one that lives in Sharepoint, the one I get to via Outlook, the one in Teams, and (maybe? somehow?) the desktop Word when editing an online file.

jsmith99 · on March 2, 2022

Word also includes a nice GUI diffing tool for docx files. It's possible to make git diff use it using something like https://github.com/ForNeVeR/ExtDiff

thrwawy283 · on March 2, 2022

Thank you for sharing this. I had no idea it existed! I was getting pandoc cleared at my work, but for the Windows environment that it is, this is another solution :)

bodge5000 · on March 2, 2022

I dont use word, but I do know people that do use it (the modern version) and they still have files called "...final 3 copy(4) really final.docx".

These features are nice and all, but the thing is, most users not only don't care about using them, but don't care about even the idea of them to know they exist, so they continue with the old workflow.

Most of the time its better to get out of the users way, but in cases like this, if you truly do want to get rid of the infinite finals, they need a bit of a nudge

wildzzz · on March 2, 2022

We use SharePoint for revisions for casual documents and another CM system for tracking releases of official documents. The CM system is meant really for CAD models but it lets you upload Word docs that it displays as PDFs with watermarked revision information so you have to have a revision table at the front of the document for listing changes. The SharePoint system is a lot easier to use but you don't get the same sort of change approval system since anyone can edit. Instead we use multiple folders for WIP, waiting for QA approval, and final versions. We have to follow ISO 9001 but anything developmental or that is just derivate of documents in the CM system can live on much more casual systems like git, SVN, or SharePoint.

xerox13ster · on March 2, 2022

That's not plain text and not easily parse-able with mutli document search

zouhair · on March 2, 2022

It's a proprietary format, it's not gonna last.

Microsoft word can't even open files from old version of the same software.

None of these word document will be readable in 50 years from now, a text file though.

catach · on March 3, 2022

A modern MS Word document is pretty much zipped XML and mostly ISO standard-compliant.

thrwawy283 · on March 2, 2022

Hmm. Sometimes I step back and I look at the version history in modern Office documents and it does feel convenient. I like the CRDT editing a lot. It still feels too loose and prone to issues, but I recognize its value to the average person.

Most of what bothers me is trying to view, read-only a previous version. Usually I have several Word or Excel documents up on my screen, and it's easy to begin modifying a past version that then becomes the latest. It's not reflex to open a previous version and untoggle autosave. It bites me all the time.. In Office's pursuit to help people avoid losing data, I wind up overwriting what I need to preserve. A lot..

I have not seen manual version names in Word doc history. Maybe I'm sheltered :s

GrumpySloth · on March 2, 2022

The idea is pleasing, but when you specifically look at Git, its UX is rather disappointing. I'm not even talking about the CLI (after a while you can get used to it). I'm talking about resolving conflicts. If I have a sequence of changes A -> B -> C, then I change A and rebase B on top of A resolving any conflicts, then rebasing C on top of B will most of the time mean that Git will most likely ask me to resolve the same conflicts all over again and more. The kind of conflicts is also staggering. Each time I look at the conflicts it generated, I cannot explain how it could come up with that craziness. And all the while it also merged some changes without signalling conflicts, but instead breaking something in significant ways, without notifying me in any way, i.e. silently breaking something potentially important.

From what I've read, Pijul should fix at least the problem of needing to resolve the same conflicts multiple times over and over again. However, I feel that version control focused on text breaks, because the merging algorithm doesn't know anything about the semantics of what's represented in text. So yes, I'd say that version control in more areas would be nice, but one based on binary formats understood by version control. One glimpse of that may be KeePassXC, which can merge password databases and has never done it wrong in my experience.

Hackbraten · on March 2, 2022

While I agree in that Git’s UX is abysmal for the beginner or casual user, no one should have to endure resolving the same conflicts over and over again.

In my opinion, the following configuration should have been the default with Git:

    git config --global rerere.enabled true
    git config --global rerere.autoUpdate true

The `enabled` part means: transparently record all resolutions in a database, and re-apply them whenever bumping into the same conflict with the same pair of files in the future.

The `autoUpdate` part means: every time you finish re-applying a recorded resolution, please `git add` the result automatically for me so I don’t have to look for a "conflict" that’s actually no longer there.

FreezerburnV · on March 2, 2022

I had no idea these options were a thing. Thank you for pointing them out, adding them to my config immediately and hoping it makes some merges and whatnot easier going forward.

pmeunier · on March 2, 2022

Hint: rerere doesn't always work. It's still guesswork, like the rest of merges/rebases/cherry-picking in Git. This is because looking only at the tips of branches doesn't work, and that's the only thing Git can do (with or without diff3 as the merge algorithm).

Hackbraten · on March 4, 2022

The rerere database is content-addressed, and rerere will apply a resolution only if the pair of conflicting files is a bit-exact déjà-vu of a conflict it has seen before.

I don’t see how that is guesswork. My impression is that rerere is perfectly deterministic, and only repeat things the user has done before.

The downside is: once the user has manually created a faulty resolution, then rerere will possibly replay that faulty resolution.

That’s how things can go wrong, and I see how one might blame that failure on rerere itself. And I think the blame not entirely unjustified, because rerere could do a much better job in explaining how it interacts with certain scenarios. For example, let’s say I’m aborting a rebase. Will rerere roll back the resolutions it recorded? I honestly don’t know. The manpage vaguely claims that it will, but I’ve been unable to reproduce it.

I wish rerere made more transparent how it interacts with `--abort`. Just printing an informational line would already go a long way. I also feel a command like `git rerere log`, which would print the recent activity of the `rerere` database, might help with that.

greyhair · on March 2, 2022

The git diff process and conflict process really goes off the rails at times, and I have never figured that out. Modify one line in a file, and add one new line two lines after that modification, and the git diff is two 40 line chunks with some common lines, a bunch of additions, and a bunch of deletions.

Really? One new added line, and one delete/add two lines above it.

it usually works fine, it really does, but when it messes up, it really messes up big time. As long as it doesn't create a conflict, you just shrug and move on, but when it does create a conflict, holy-moley!

Nullabillity · on March 2, 2022

The biggest UX sin of Git is that rebasing is featured so prominently. If you had just done a normal merge commit then Git could have realized the common history, done a three-way merge, and resolved the conflict automatically.

But no, people always seem to insist on that linear history is the only concern that matters, explicitly delete the history, and then wonder why Git is so annoying and easy to screw up.

Maybe Git should have locked rebase behind a feature flag of some kind. `git pull --rebase` should certainly never have been added.

GrumpySloth · on March 2, 2022

When I have a chain of commits pending review on Gerrit, and I fixed some flaws in the first commit in chain and need to rebase the dependent commits, merging is not an option. And rightly so: when I look at the master branch, I don't want to see random corrections someone made during review, they're just irrelevant once the reviewed changes are merged.

There are many reasons why linear history is important. Rather than saying that "you shouldn't want to do that", I'd prefer it if the tools people use were fixed to better serve the things people actually want to do.

A merge would only make sure that I don't need to resolve the same conflict multiple times. It wouldn't result in the conflicts not being generated in the first place, or them being saner. The only difference in conflict resolution between rebasing and merging is that the sides are flipped (what's shown on left in rebase, is on the right in merge, and vice-versa). Which doesn't address the second issue I listed: conflicts are not only repeated, but the conflicts themselves are pretty crazy. In my example, often-times C would never even touch files which were detected as having conflicting changes with A or B. It was pretty absurd that I would have a 3-line change in C, several hundred-line changes in A and B, yet the biggest conflicts would be triggered when rebasing C on top of B, and those would be in files C did not touch. Other times diffs in conflicts would have most of the lines added and removed actually identical, with only a couple in the middle different. Why would git mark them as conflicting, is beyond me. And then there was the issue of git not detecting conflicts, when it should have, instead merging changes in a way that broke source code. None of those issues is better handled by merging than by rebasing. Most of them however have pretty good solutions in a patch-based version control system, as opposed to snapshot-based like git (conflicts in files which weren't changed? not going to happen). An even better solution would be binary formats with domain-specific merge logic, like the one in KeePassXC.

Nullabillity · on March 2, 2022

> When I have a chain of commits pending review on Gerrit, and I fixed some flaws in the first commit in chain and need to rebase the dependent commits, merging is not an option.

No, you just add it at the end of the commit chain?

> And rightly so: when I look at the master branch, I don't want to see random corrections someone made during review, they're just irrelevant once the reviewed changes are merged.

git log actually has the --first-parent for this, which hides all of the commits that were merged into the branch, without destroying the history when you try to go back and try to understand why the choices were made. The idealized version of history created by constant rebasing serves neither purpose.

> Rather than saying that "you shouldn't want to do that", I'd prefer it if the tools people use were fixed to better serve the things people actually want to do.

Agreed, tools that cope poorly with merges should be fixed, rather than forcing people to hack around it by rebasing.

> None of those issues is better handled by merging than by rebasing.

Not quite, rebasing generates more false positives, since it tries to merge every intermediate commit instead of only looking at the end states and the common ancestor.

pmeunier · on March 2, 2022

> However, I feel that version control focused on text breaks,

Pijul doesn't, by the way, the diff algorithm is customisable. I wrote one industrial application of Pijul that uses spaces as breaks, and semantics-aware breaking is totally doable.

vlovich123 · on March 2, 2022

Have you tried git rere?

GrumpySloth · on March 2, 2022

No. That's the first I'm hearing of it. Thanks. I'll try it in the future.

bob1029 · on March 2, 2022

> “Git for everything“ would be a multi-billion dollar startup easily.

In addition to plaintext documents, I have found that a simple JSON diff is also a very effective way to demonstrate changes between 2 complex biz objects. Non-developers can cope with this as long as the differences are visually obvious (red=removed, green=new, etc) and the object graph is reasonably flat. Everything can be trivially serialized to a JSON document, so this scales super well in my experience. We use a port of Google's DiffMatchPatch to generate human-friendly HTML reports of object diffs in our latest administration tools.

jollybean · on March 2, 2022

Git is barely useable by technical people.

Git would be a byzantine disaster for 'everyone else'.

The graph and abstractions involved introduce enormous unnecessary complexity.

Now, 'historical changes for everyone' - yes.

But not Git. Git is very powerful, but ultimately a questionably valuable product in many cases overall. And we've now settled on it, so it's a bit difficult to displace.

yarky · on March 2, 2022

I was actually surprised : last time I used Git Bash on Windows it did work quite well with .docx files.

Then I switched jobs to avoid touching .docx files ...

The best alternative I've found to generate .docx output is R markdown, which uses pandoc under the hood and let's you program the whole document the way LaTeX would.

greyhair · on March 2, 2022

Pre LaTeX (I am old...) I wrote a huge command definition document for an embedded telecom product in nroff/troff. The whole thing was text files, with all the common parameters documented once, and the whole thing was assembled using a Makefile. So a typical page would be the command name, a descriptive paragraph, an include list of the parameters it used, and an include list of the possible return errors. Very little writing for a new command that mostly used existing parameters and errors.

And all tracked under CVS with a management/tracking layer on top.

# make command_doc

and the pile of text became a lovely 250 page postcript ready for the laser printer.

TuringTest · on March 2, 2022

> The best alternative I've found to generate .docx output is R markdown

I do the same with markdown in Zettlr, which has a nice inline-preview format and can be used by my non-tech partners.

The generated .docx has a limited range of styles, but we see that as an advantage.

newbamboo · on March 2, 2022

Powershell can generate and manipulate word docs, as another alternative to keeping content in a markdown file.

IceWreck · on March 2, 2022

Isnt docx binary ? So git is storing a new version of the file every commit. That way too much storage space youre wasting.

testermelon · on March 2, 2022

I've heard somewhere that docx is actually gzipped xml. But I never really confirmed that myself. But it's binary once gzipped, so your point still stands.

simion314 · on March 2, 2022

>I've heard somewhere that docx is actually gzipped xml. But I never really confirmed that myself.

docx and epub are zip files, you can rename them with a .zip at the end and open to see what is inside. It might not be as simple to zip them back, at least for epub is very important to zip the files in a certain order but I forgot the details, but is easy to do from command line.

bellweather49 · on March 2, 2022

Or use Vim; it has built in support for zip files, so you can just type something like `vim my_file.docx` and it will open the files in netrw (the built in file explorer). Move to the file you want and hit enter. "word/document.xml" has the main document contents in it.

The xml will probably need to be run through a formatter to be readable. You can type `:%!xmllint --format -` if you have xmllint installed.

Now prepare to spend several hours trying to make sense of the xml. :-p

simion314 · on March 2, 2022

Vim is not for me, I unzip the epub/docx and then open the xml/html files in Kate, I had to do this to examine what is saved or how it saved or if my custom epub exported worked correctly. For epub I think you need to make sure the metadata file is the first one in the archive for it to work(so not sure if editing stuff directly in vim will preserve the order)

JonathonW · on March 2, 2022

Docx (and pptx, and xlsx) is a zipped (not gzipped) composite of several XML files plus any other attached resources. It extracts out to a whole folder structure.

Definitely binary once compressed, though, and even when extracted not an easy format to parse. It might be XML, but it’s still representing the full complexity of an MS Office document.

zouhair · on March 2, 2022

Juzipped. Just take a docx file and append .zip to it and open it.

tashbarg · on March 2, 2022

Git stores a new file anyways. It’s only for packs, that delta compression is applied. And delta compression handles binary files.

BerislavLopac · on March 2, 2022

That's exactly what the actual git does too.

copperx · on March 2, 2022

Doesn't git do binary deltas?

thrwawy283 · on March 2, 2022

Okay. Agree entirely.

I love some of the collaborative nature of Microsoft Teams and CRDT editing Word/Excel, but I'm usually pretty remote. Text over a tenuous WAN connection is ideal.

I work at a government agency and I was /just today/ getting them to review and approve Git and VS Code for our staff use - and pandoc. A couple years ago we never would have gotten open-source software approved. I wish I knew of an equivalent to SELinux or AppArmor for Windows so they could lock down things a semi-trusted application can do. VS Code will be used in a few different departments, but I mostly want it to help those unfamiliar with Git and its CLI (Git Graph is nice).

There's a trick out there to first convert things like Word documents to Markdown, and then do a diff of that intermediate output: https://hrishioa.github.io/tracking-word-documents-with-git/

Pandoc can only do so much. I'm trying to convince my part of the gov to put policy documents we disseminate to staff in a git repo, so we can track who did what and why (based on commit messages). This will be a big step, but thankfully one part of our org is already moving toward version control for IT and data analytics so I'm proposing this and suggesting we hop on their bandwagon. Momentum.

There's a Word template that we commonly use for legal purposes where it gives you line numbers and you write text to align with the numbers. When reviewing it's easy to say "change line 23 to read ....". This - for example - will not translate well through pandoc because the numbers and the text are separate text elements.

There's a market for making pandoc better. Visually translating how elements "flow" from 1 format to another, instead of simply transpiling XML to Markdown.

I'm still excited thinking in a year I might be able to git-blame the legal department for things. Or see a diff from 1 administration to the next.

lifeisstillgood · on March 2, 2022

>>> There's a market for making pandoc better.

Some thoughts:

1. Policy makers think they write policy in English (or other human language). But more and more it is the software.

2. "policy engines" aren't going to cut it - we need to introspect code to decide what the code does - and translate that back to policy.

3. at my work the best solution i have got is using unit tests to explain what the policy is based on test comments - and again that's using english and again it's terrible ("best")

I think the real solution is both wider software literacy so that discussion happen "in code" and code that is more like policy (composable functional languages are thus likely to be useful here)

But great to hear you are taking any steps at all - would an HN letter writing campaign to your ministers help?

copperx · on March 2, 2022

I love pandoc and git and writing text files, but oh my, unless the users in your organization are tech savvy, your scenario sounds like a recipe for disaster.

thrwawy283 · on March 2, 2022

Admittedly, if there's any chance to this I think it will start with tech-savvy assistants helping to preserve documents on the backend of things. When a process takes hold budgets usually get restructured and a dedicated application is contracted around the existing process or data. I'm at minimum trying to prove the value of strict version control in my space :)

goosedragons · on March 2, 2022

At least for LaTeX there are packages like lineno that will add line numbers to the compiled document which can then be referenced with a label automatically similar to equation or figure captions. So if somebody tells you to change line 23 you can reply with "Please see line \lineref{whatever} for change." and it will automatically fill where that change is to "Please see line 25" or whatever in case it's been moved by other changes.

JohnL4 · on March 2, 2022

I have long thought writing law is a LOT like writing code. When you actually look at modern laws, they're basically legal patch files.

jandinter · on March 2, 2022

> “Git for everything“ would be a multi-billion dollar startup easily.

Worked on a “Git for Word” project [1], which is currently on hold.

The diff part was manageable, though not trivial to get diffs that make sense for prose/regular text.

The hard parts are UX/UI (making Git concepts transparent to “normal” users) and merging. Yet without automatic merging, branching is not very convenient.

Would love to collaborate on this in the future again. Reach out if you are working in this space, happy to share.

[1] https://julesdocs.com

koolba · on March 2, 2022

I’ve had better than expected success with diffing word files by converting them to markdown via pan doc. It’s nowhere near perfect as you lose nearly all formatting, but if only the actual text content is changing it allows you to automate the display of those changes.

willis936 · on March 2, 2022

I don't think merging will ever be fully solved by software. It's a problem created and solved by process. How annoying merges are is entirely dictated by process.

Sourcetree is the best git GUI I've used. That could be used as a model.

I think an old-style solution to merging would be fine: output a word file that uses a unique font style to indicate which user made what conflicting changes, have the user edit the document and remove all of the "merge styles", then continue.

Seirdy · on March 2, 2022

Yeah, in high school I and another student did our essay peer reviews using annotated diffs once. It was so much easier than Google Docs, since we didn't have to leave our editors.

Wrapping version control with a non-tech-friendly porcelain could help a lot of people escape user domestication from vendor lock-in.

copperx · on March 2, 2022

I don't care how much porcelain you add. Using git effectively beyond add and commit requires undestanding quite a few concepts. Mass adoption of git in the consumer space ain't happening anytime soon.

greyhair · on March 2, 2022

It cracks me up that this many years later, you put five people in a room to establish a git branch/merge/release strategy, and you get five very different opinions, and such strong opinions they are.

Luckily, that responsibility is not in my wheelhouse, I just have to live with whatever decisions are made.

So yeah, mass adoption in the consumer space, I don't see that happening.

A revert or a rebase gone wrong and wails of "what happened to my file?"

Swizec · on March 2, 2022

> The number of files with names like “Report Final Final draft v3.docx” is truly staggering.

iirc this was the original pitch for Dropbox

> “Git for everything“ would be a multi-billion dollar startup easily.

Dropbox is valued at 8.5 billion

ovao · on March 2, 2022

Dropbox version history is limited to 30 days (free plan) or 180 days (paid plan)[1].

I like Dropbox, but for documents, 30 days worth of version history is not fantastic.

[1]: https://help.dropbox.com/files-folders/restore-delete/versio...

squeaky-clean · on March 2, 2022

The original pitch was "throw away your usb drive", I don't think file history came about until after 2010, but I'm not too sure and modern Google makes it impossible to find anything older than 2 years ago...

https://news.ycombinator.com/item?id=8863

slightwinder · on March 2, 2022

It's not that hard to search for old content on google. Just limit the year in the search filter. Anyway, I found mentions of this feature up to 2008 back. I guess it was there since mostly the beginning.

From Sep 15, 2008: https://www.maketecheasier.com/dropbox-backs-up-and-syncs-fi...

vishnugupta · on March 2, 2022

> “Git for everything“

Just to show how useful this is here’s Indian Constitution with amendments as commits:

https://github.com/anoopdixith/TheConstitutionOfIndia

Matrixik · on March 2, 2022

I mourn that StackEdit [1] got abandoned. It's online markdown editor that can use git as a backend. Fully cross platform editing (in browser) with synced all text. I used it with GitHub private repository for all my notes but editing on mobile was really buggy. So I moved to notion (unfortunately).

[1]: https://github.com/benweet/stackedit

pokstad · on March 2, 2022

NB does exactly this: https://github.com/xwmx/nb

anta40 · on March 3, 2022

Interesting, as much as I admire Evernote, I want something that is open source and can be installed locally. Of course, Git integration is another plus point.

dpcan · on March 2, 2022

I think my Dropbox is already basically this. It has a revision history of my text files I think.

dpcan · on March 2, 2022

Dropbox does this but it’s limited to a certain number of days, so not the exact same thing. I love the idea of a simple hit for everything.

yjftsjthsd-h · on March 2, 2022

> "Git for everything“ would be a multi-billion dollar startup easily.

AIUI, git is already prepared to be that, it just needs diff programs that can handle whatever format. I mean, other than git's interface being its own impediment to non-technical users.

loudmax · on March 2, 2022

"Report Final Final draft v3.docx" is in a proprietary format format controlled by a single company. The spec may be open in some sense, but it's far from a level playing field.

sgjohnson · on March 2, 2022

> is in a proprietary format format controlled by a single company.

It's not proprietary by definition. A nightmare to implement? Yes.

But definitely not proprietary.

_dain_ · on March 2, 2022

It doesn't follow the published open spec so it is de facto proprietary.

yjftsjthsd-h · on March 2, 2022

Right, but if that company released a diff tool for docx (or someone reverse engineered one), git would work with it. I don't expect that to happen, I'm just saying git isn't the limiting factor.

sgjohnson · on March 2, 2022

Pre-commit hook to convert to plaintext or markdown[0].

And there you go, a human readable diff.

[0]https://github.com/benbalter/word-to-markdown

conductr · on March 2, 2022

I feel like this is a solution without a problem. Or a problem without pain.

Most people have no issues navigating a full folder of revisions to get to “Report Final Final draft v3.docx” but anything resembling version control would simply be unused. At corporate level, the version features of box.net, egnyte, and others are rarely used. I'd say most people don't even know they can navigate the revisions until they are in a data loss situation and asking about how to recover a corrupted file (which occurs most frequently with Excel files in my experience)

kumarsw · on March 2, 2022

I wound up writing dupver https://github.com/akbarnes/dupver after getting frustrated with the lack of versioning tools for binary files. One neat thing about .docx files and their ilk is that they are "just" zip files so it isn't hard to add special handling to pull out their contents and run deduplication over that.

judge2020 · on March 2, 2022

> “Git for everything“ would be a multi-billion dollar startup easily.

Version control with online MS Office and Google Docs seems to be going pretty strong.

ho_schi · on March 2, 2022

git --everything-is-local

Which is the next thing. Git works just local and uses servers to sync. The purpose of servers is syncing and not depending on them. From this everything else comes, autonomous usage, speed, reliability, recovery.

deepGem · on March 2, 2022

"Git for everything" is the first thought I had when I opened this article. I am trying to keep all my notes in Google docs and it's all good as long as you stick to their format. I can't edit text files in Google docs ! I mean, how absurd is this.

I feel like building a Google docs clone for simple text files. Just one feature - versioning (A Time machine like interface) and perhaps add collaborative editing later. I just want to write and store a text document without having to create multiple files. Automatic versioning of snapshots so I can go back in time and refer to any timestamp.

jsnell · on March 2, 2022

Have a look at Etherpad.

chubot · on March 2, 2022

Yup, that's basically the same argument I'm making with this diagram -- with text, you can get useful operations "for free"

http://www.oilshell.org/blog/2022/02/diagrams.html#text-narr...

https://news.ycombinator.com/item?id=30483914

It's generally not something you want to reimplement ...

accrual · on March 2, 2022

> "Git for everything"

macOS Time Machine and Windows Shadow Copy aren't perfect git-based versioning systems, but it's nice something in the general direction exists.

https://en.wikipedia.org/wiki/Time_Machine_(macOS)

https://en.wikipedia.org/wiki/Shadow_Copy

m_eiman · on March 2, 2022

macOS also has support for local file history, even if many apps try to ignore these features: https://support.apple.com/sv-se/guide/mac-help/mh40710/12.0/...

When they introduces this they changed the standard file operations and replaced "Save as…" with "Save a copy" - this was not universally welcomed.

trevormeier · on March 2, 2022

> “Git for everything“ would be a multi-billion dollar startup easily.

Didgets (recently surfaced here on HN) seems like a sane approach to this, from the file system up. Pretty incredible performance too.

https://didgets.substack.com/p/where-did-i-put-that-file

Sadly not (yet) open source, though the developer is considering it.

chupchap · on March 2, 2022

Isn't that what Google Docs does already?

ryanqian · on March 2, 2022

I like this too, easily share is really a huge plus for any type of format text.

brightball · on March 2, 2022

One of the big things I love about Dropbox is automatic version control of the file.

Google docs has a similar feature built in too I think.

innomatics · on March 2, 2022

I switched from dropbox to mega years ago when the former removed Linux support. TIL mega have version support so I need to enable that and try it out.

First thing I do on a new device is setup my mega shared drives - and there is a lot of plaintext files there!

slightwinder · on March 2, 2022

Dropbox never removed linux-support at all, or sync specifically. They removed support for some esoteric filesystems which lacked certain features. For most people dropbox on linux did not stopped working.

brightball · on March 2, 2022

Been using it on Linux for many years but it does feel like their Linux support has gotten worse.

orhmeh09 · on March 2, 2022

Dropbox removed Linux support? It seems to work fine for me on arch Linux with btrfs.

innomatics · on March 2, 2022

It was sync specifically, and four years ago. Maybe it works now.

https://news.ycombinator.com/item?id=17732912

10729287 · on March 2, 2022

If I remember well, they removed the ability to host Dropbox folder on a encrypted drive.

nly · on March 2, 2022

It works well if the text format in question is line oriented, otherwise it doesn't.

rmbyrro · on March 3, 2022

I thought Google Doc had solved this with single-version model and automatic change history with easy restoration.

niutech · on March 3, 2022

As for "git for everything", try SparkleShare or LogicalDOC CE.

nuker · on March 2, 2022

But can you use git for the .docx files?

sgjohnson · on March 2, 2022

You can, but .docx is binary (it's .zip, and the Word document itself is OpenXML)

So you won't have a neat revision history, unless you implement something in the pipelines to also convert .docx to something human readable, because .docx and OpenXML isn't.

elteto · on March 2, 2022

This is one of the main strengths of Obsidian, which I have been using lately and I’m extremely happy with: everything is just Markdown files in some folder on disk. Zero danger of lock in.

You can have a disk hierarchy if you desire, or trust that search will find you what you are looking for. For me search works fairly well and if I need something extra I can just grep/sed/awk.

Plus it has the features that a simple text editor will not have: displaying images, live preview, relationships between topics, etc. It even has a Vim mode!

nextos · on March 2, 2022

I use org-mode the same way. I have a flat directory with many small files, inspired by Zettelkasten and by simple wikis. No server, no dependencies.

This also works well for small organizations. Both GitHub and GitLab are able to render org files, with some minor limitations. Hence, a simple repository full of org files is already a wiki. Even local links just work, with zero dependencies.

It's possible to transform org files into HTML with some CI task to get support for all org features. But I have found this a bit of an overkill. I don't need most org features. The basics are already great: outlines, timestamps, hyperlinks, tables and footnotes.

You can achieve more or less the same things with Markdown as well.

rileyphone · on March 2, 2022

There’s also org-roam, which introduces the bi-directional links that systems like Obsidian have. Just make sure you’re on v2.

This is truly the era of the memex.

DerArzt · on March 2, 2022

The only problem with org (which may not be a problem for some folks) is that if you want to be able to edit org files with the least friction you need to use Emacs. I tried the org extension for VS Code and vim in the past and they just weren't quite the same as editing org via emacs.

sylens · on March 2, 2022

This is exactly why I switched to Obsidian after trying a few others like Bear, Notion, and Craft. Let me see the files on disk! Don't abstract it away or hide it from me.

awill · on March 2, 2022

It's just unfortunate that the proprietary apps have IMHO much nicer UI/UX. Why can't we have both!

On Mac I use Ulysses. I get the benefits described here while still having a fast, native app with excellent UX.

I have Mac for work, and Linux for personal and an Android phone. That means Ulysses on Mac, ThiefMD on Linux, and Markor on Android, all synced with syncthings. Native apps on different OSes is the best solution I've been able to come across.

A single cross platform app would be preferable, but the last time I used Obsidian it was slow, heavy, bloated, with a pretty ugly UI, likely in part due to running on electron.

dmje · on March 2, 2022

The look of Obsidian put me off too but after I got into it I realised the whole theme is totally css-able. It's really easy to make it your own.

I do agree though, they do themselves no favours with that ugly black / purple website. They need a much better, much prettier default.

All that aside, I'm delighted with my switch to Obsidian. Just an amazing tool, and total reassurance that it's just markdown with all the benefits that brings. So glad to be rid of Evernote...!

FireInsight · on March 5, 2022

I've always liked Obsidians design or/and haven't thought of it as bad. We've all seen way worse looking applications.

D13Fd · on March 2, 2022

I use Ulysses too, but I it could be faster. I haven’t tried them in the latest versions, but in the past I’ve had issues with even medium-sized files with lots of formatting.

cytzol · on March 2, 2022

Serious question: how do you square your third paragraph with your first? That is, if you're using Obsidian's features like [[square bracket link syntax]], or #tags, or inline images, aren't you effectively locked in to editors that support the same set of features?

kemayo · on March 2, 2022

Not really -- you're only locked in to the extent that Obsidian makes those things easy

If Obsidian went away, I'd still have a bunch of text files that I understood, and I'd know that I could find things tagged with #foo by using grep, that the [[links]] just mean to open links.md, that an image should be opened in a browser, etc.

ovao · on March 2, 2022

Additionally, it’s pretty to extend most Markdown parsers with extended syntax. The worst thing that happens is you have to fork a parser to add extended syntax…which isn’t so bad.

runjake · on March 2, 2022

It’s still all plain text.

Best practice for tags is just include them in the plain text file, and inline images in Obsidian is like Derek said — just keep them in your folder structures just like your text.

vertis · on March 2, 2022

square bracket syntax is fairly well used in wikis. While there are varying degrees of user experience if you step away from Obsidian, it's still readable. You can go and convert the links to []() either inside Obsidian with a plugin or later with a script.

You'll be able to get access to it in 20 years, even if not in a shiny UX friendly way. Which is more than can be said for some of the really old notes I took in proprietary pieces of software.

disruptthelaw · on March 2, 2022

And easy linking and bi-directional linking. Once you experience these two features it’s very hard to imagine notes without it. That’s the main reason I couldn’t go with the authors workflow of just plain text files.

wodenokoto · on March 2, 2022

I do wish they had a cheaper commercial tier that maybe didn’t offer priority support or something.

$50/year puts you next to some good developer tools, that already do markdown pretty well, and I just don’t see the point of writing notes if they’re not also for work.

I’m sure it’s worth it once you have 100s or 1000s of notes by i somehow feel like it’s a bit of a too big pill to swallow in the beginning.

hammyhavoc · on March 2, 2022

You can use it free, mate. Just sync your markdown files via your favourite provider, or your own WebDAV, e.g. Nextcloud, or use a free sync plugin. There's even git plugins if you're a fellow nerd that wants version control.

wodenokoto · on March 2, 2022

According to the license, you can't use it commercially without the commercial license, and I think the license is very clear that work related notes are included in commercial definition:

> You need to pay for Obsidian if and only if you use it for revenue-generating, work-related activities in a company that has two or more people. Get a commercial license for each user if that's the case. Non-profit organizations do not need commercial licenses.

https://obsidian.md/eula

pfix · on March 2, 2022

But then I consider $50 not that much for a software that helps you earning your salary. And it becomes tax deductable this way.

If it doesn't help your work or a free markdown editor provides you the same value, you have your answer.

As I use it for my job I consider it worth the 50 bucks. I still would prefer it to bee FOSS, though (and be willing to pay, anyway) - as then I knew even the software might be still around when the company behind it is gone.

A lot of utility with obsidian comes from the plugins from the community (e.g. excalidraw) and even though they might store their stuff also on your disk as text files, the utility of that eco system is gone as soon as one switches (or changes, or requires a different file format for the text files or whatnot)

hammyhavoc · on March 2, 2022

Interesting!

Temporal_Trout · on March 2, 2022

Obsidian is great. I use Syncthing to share my "database" between all my devices. I can do some quick edits/take notes on the go and when I get home the changes are already mirrored to my desktop.

tdhz77 · on March 2, 2022

Been using obsidian for two weeks and the tagging / linking to documents is top notch, saving it to my internet drive at work brings out the best of all worlds

merdaverse · on March 2, 2022

Obsidian looks interesting, but how do you sync between devices? Their sync service seems really expensive. On desktop I guess you can use git/Dropbox plugin, but what about mobile? I take a lot of notes on my Android phone.

awill · on March 3, 2022

I currently sync with FolderSync on Android. That syncs a Google Drive or Dropbox folder to your phone. Then use that folder with Obsidian

rattray · on March 2, 2022

I love Obsidian, but I'm now beginning to move many of my documents to Notion for sharing with a team.

I really wish Obsidian, or something like it, worked well for teams. Maybe it's on their roadmap, I'm not sure.

prashantsengar · on March 2, 2022

Have you tried Athens Research? [0]

It is open source and everything is stored locally but not in markdown files. You can self-host the desktop app to share notes.

[0]: https://github.com/athensresearch/athens

hammyhavoc · on March 2, 2022

I went to visit https://athensresearch.github.io/athens/ and got "Only browsers based on Chrome/Chromium are supported" on Firefox. Lol.

D13Fd · on March 2, 2022

Wow, iOS Safari gets the same message. Interesting choice…

rattray · on March 2, 2022

That doesn't seem designed for growing teams – for example, it doesn't seem to have comments.

prashantsengar · on March 2, 2022

They are still in beta currently, focusing on small teams right now so there is hope for comments.

ParetoOptimal · on March 2, 2022

How is notion? I heard it was slow?

Also, is real time collaboration a game changer or a gimmick?

I used to think the former, but was questioning it recently.

What would be lost with syncthing + obsidian for your team or maybe something that does autocommit like fossil for probable conflicts?

lufasz · on March 2, 2022

It's still painfully slow, and the WYSIWYG editing is very clunky.

The real time is not as real time as, say, Google Docs. There's a lot of lag.

rattray · on March 2, 2022

It's faster now, I haven't been bothered by speed yet.

But yes, realtime collaboration is a critical feature for documents that can be used during meetings (and IMO ~no meetings should be held without documents).

The other critical feature is comments, which I assume Obsidian does not have support for (they would be very nontrivial to represent in markdown).

rattray · on March 6, 2022

For posterity, I got pretty annoyed by the clutter of "Type / for commands" and stuff, and used this Stylish theme to clean things up a bit:

https://gist.github.com/rattrayalex/2e4f934045aabc4caeeab249...

keithnz · on March 2, 2022

yeah, likewise, I have been using markdown and folders of files for ages, started to use Obsidian and it provides a nice editor for that, and yes, it even has a vim mode!

xenodium · on March 2, 2022

Plain text adoption often implies markdown for a richer experience, but we also have the wonderful https://orgmode.org markup.

There is no shortage of markdown-based tools on all platforms. Our org markup options, on the other hand, are very few outside of Emacs. Org markup itself is super versatile and can power lots of use-cases.

I built two org-powered apps for iOS myself:

https://plainorg.com

https://flathabits.com

There are other great ones out there:

https://beorg.app

https://logseq.com

https://organice.200ok.ch

https://orgro.org

http://orgzly.com

Lastly, a shoutout to Karl Voit who's been driving org markup awareness outside of Emacs with Orgdown https://gitlab.com/publicvoit/orgdown. He's also discussed org markup's strengths at https://karl-voit.at/2017/09/23/orgmode-as-markup-only

altgans · on March 2, 2022

I've been intensively using Orgmode for a year and a half (wrote my thesis in it) and then abandoned it to switch to Markdown.

Orgmode quickly turns into not-quite-plaintext with humanly impossible to read and very distracting data structs stuck inside the text. I think these were called "Properties"?

For me the beauty of plain-text is that it can be read and written in any Editor without needing syntax highlighting. Here Orgmode fails for me, as I found it unreadable and unusable without an Emacs-esque toolkit.

ubermonkey · on March 2, 2022

Yeah, I agree. I use org, but if I'm writing a document I'm in Markdown, not the Org markup. It's just noisy.

Emacs partisans are often very, very sure that the emacs way is always the best way, sort of like evangelicals. (And I say this as a guy who has emacs and orgmode open all day, every day.)

seanw444 · on March 2, 2022

That's a fair gripe, as an Org-mode+Emacs user. I certainly see the clunkiness you experience in its structure. But for me, I find it highly unlikely I'll ever use an editor other than Emacs for anything but basic and quick editing. For things as comprehensive as to necessitate writing in Org-mode, I'd rather do it in Emacs anyways. Need to port it to another editor? Two options: Org-export it to a more portable format, or get started on a comprehensive integration for Org-mode in the new editor.

I personally would be willing to put the time into developing Org-mode functionality on-par with Emacs, in a new editor, if the new editor were to have major advantages over Emacs. But I don't think I'll come across one for a very long time.

Siira · on March 4, 2022

The kind of “data-rich” org file you don’t like would presumably be impossible in markdown. So what are you gaining at all? Just don’t write such data-rich org files. That’s totally under your own control.

Aardwolf · on March 2, 2022

I wouldn't call those plain text, if it doesn't keep your newlines by default, it's not plain text but a markup language like HTML

BBCode is nice and keeps newlines, but something that uses markdown's syntax for bold and titles, but doesn't remove newline characters, would be ideal

urlwolf · on March 2, 2022

Don't forget asciidoc. Coherent, has markup for things like video etc. It's a superior format to markdown, a pity it's so unknown

maleldil · on March 2, 2022

Why do you think it's superior to Markdown?

mrpotato · on March 2, 2022

I was also interested in the answer to your question. Here's[1] why AsciiDoc thinks it's better than Markdown.

[1]https://docs.asciidoctor.org/asciidoc/latest/asciidoc-vs-mar...

atweiden · on March 2, 2022

As someone who lives in a terminal, I find @junegunn’s “journal” markup format a lot more pleasing to the eyes than Markdown — holy kaleidoscopic colours, batman. Bonus: in a weird way, the colours of journal trained me to love lisp.

[1]: https://github.com/junegunn/vim-journal

tveyben · on March 2, 2022

@xenodium Wow - impressive reviews for your ‘plain org’ app.

When I grow up with my eMacs and org usage (I’m only playing now as org is new to me - but I know it’s what I have been missing!!!) I for sure must give your app a shot for iOS usage

xenodium · on March 2, 2022

Nice to hear. Thank you. Enjoy your emacs + org journey! It’s quite a fun ride.

jerrygoyal · on March 2, 2022

> There is no shortage of markdown-based tools on all platforms

Could someone name one decent md editor for Android?

Madeindjs · on March 3, 2022

[Markor](https://github.com/gsantner/markor), free, open source and available on F-Droid.

dm319 · on March 2, 2022

And vimwiki!

jasode · on March 2, 2022

>If you rely on Word, Evernote or Notion, for example, then you can’t work unless you have Word, Evernote, or Notion. You are helpless without them. You are dependent.

Although I deliberately avoid Evernote and Notion for those stated reasons, I'm fine with Microsoft Word. I've been using it for 20 years. It even opens my proprietary DOS WordPerfect ("*.wp") files from 1980s!

Sometimes I need more flexible layout of fonts and graphics and MS Word is the tool I use for that. Markdown isn't an alternative. (EDIT add: Markdown isn't powerful enough for the style of documents I write. E.g. MS Word has tools to overlay graphic elements like arrows and callouts floating as editable layers on top of screenshots. Markdown can't do that.)

I'm confident I can still use it as a tool decades into the future. Even if Microsoft became more user hostile and eliminated local install in future versions and required a cloud-only Office 365 account, I'll just keep using Word 2019. LibreOffice may also be a Plan B option to open docx files. Not nervous about MS Word files at all.

chrisweekly · on March 2, 2022

"Markdown isn't an alternative."

Strongest possible respectful disagreement.

--- ^ orig comment above, edits below (I was interrupted when 1st commenting; apologies to anyone who responded to the short version) ---

See eg Obsidian (https://obsidian.md) for a great example. It's astonishing how feature-rich it's become. WYSIWYG editor, Excalidraw integration, Slid.es, etc etc etc.

And if you have any webdev chops at all, you have access to HTML, CSS and JavaScript. It's not even close.

_ofdw · on March 2, 2022

Markdown is actually awful. There's at least 3 different dialects that are all called Markdown, so if you want your text rendered it's a toss-up as to whether or not it'll look how you intend.

I think that Markdown's success is due to a lack of a widely-used, simple alternative that's well-specified.

Org is easily and objectively the best markup but of course it's only very very recently that there was a non-emacs implement worth a damn.

Seirdy · on March 2, 2022

Then pick a standardized dialect. Commonmark is the lowest common denominator; GFM is useful for when you need some extra features. Markdown also allows HTML for whenever you need something fancy.

The advantage of markdown isn't robustness, it's simplicity. Preteens can get a handle on it in minutes when using Reddit.

The main benefit that all these markup-shorthands serve IMO is getting people to add meaning semantically rather than with presentation. Word processor users typically adjust font size and color to make headings, even when given a list of headings right in the ribbon. Markdown is just an HTML shorthand: you can only work with semantic meaning.

_ofdw · on March 2, 2022

>Markdown also allows HTML for whenever you need something fancy.

Indeed and therein lies what I view is Markdown's worst feature.

Groxx · on March 2, 2022

While I broadly agree, particularly because it feels so fuzzy about when it starts interpreting it as raw HTML and when it goes back to markdown... I'm not sure how you can really claim Org is unambiguously better (best!) when Org mode has multiple flavors of almost exactly the same kind of feature. E.g. inline HTML in Org is: https://orgmode.org/manual/Quoting-HTML-tags.html#Quoting-HT...

    @@html:<b>@@bold text@@html:</b>@@

Or inline LaTeX, which is arguably much worse than markdown's HTML fuzziness: https://orgmode.org/manual/LaTeX-fragments.html#LaTeX-fragme...

>To avoid conflicts with currency specifications, single ‘$’ characters are only recognized as math delimiters if the enclosed text contains at most two line breaks, is directly attached to the ‘$’ characters with no whitespace in between, and if the closing ‘$’ is followed by whitespace, punctuation or a dash.

If I understand that correctly, it means this is LaTeX:

    $a^2
    +1
    =b$

But this is not:

    $a^2
    +1
    +2
    =b$

Similarly, this is apparently an inline chunk of LaTeX: https://travel.stackexchange.com/questions/39527/which-count...

    USA tends to use $123 to show dollar amounts,
    but e.g. Germany apparently sometimes uses 123$.

approxim8ion · on March 2, 2022

I get that Markdown is expected to be rendered into HTML eventually, but I find the syntax useful in its raw form too, just for formatting text files.

ubermonkey · on March 2, 2022

>Org is easily and objectively the best markup

I laughed.

tdhz77 · on March 2, 2022

Most of these differences are managed rather easy.

ddulaney · on March 2, 2022

> Sometimes I need more flexible layout of fonts and graphics

That’s the key. For much of my writing, all I need is text with a little formatting, and markdown is great for that as a user. (It’s less nice if you’re writing a parser, but that hasn’t come up for me yet :) )

For some of my writing I need professional-grade layout. There I either use LaTeX directly, or I use pandoc then LaTeX.

But sometimes I need to:

- Include images that don’t have a public URL

- Use more than one font

- Have a recipient be able to edit the document

- And the recipient isn’t a developer

Markdown doesn’t check all of those boxes. Word files do (as do Google Docs links, ODF files, and some others).

Markdown is great, but it’s not enough all of the time.

ubermonkey · on March 2, 2022

Asking people to become web devs to get the kind of formatting Word makes trivial is not a winning argument.

chrisweekly · on March 3, 2022

Agreed. It's also not the argument I was making.

falcolas · on March 2, 2022

HTML works quite well for images and text placement. Latex (and its derivatives) work even better.

Word isn't bad, so long as you keep paying your subscription.

jodrellblank · on March 2, 2022

> "Word isn't bad, so long as you keep paying your subscription."

Web Word is free, like Google Docs is. (Meaning, it costs your credentials and data rather than your money).

https://www.techradar.com/uk/how-to/how-to-download-and-use-...

https://www.microsoft.com/en-us/microsoft-365/free-office-on...

CRConrad · on March 3, 2022

> Web Word is free, like Google Docs is. (Meaning, it costs your credentials and data rather than your money).

For now. You don't know if and when Microsoft's business model is going to change.

emptyparadise · on March 2, 2022

I disagree with the whole "NEED VISUALS OR GRAPHICS?" section. I do need visuals! I do need graphics! I'm not a typewriter, why must my flow be interrupted with me having to leave my document to open some JPEG file whenever I need to show something that isn't text?

It's not like this is crazy difficult or requires proprietary apps either. Writing a modern browser may be an impossible feat but I'm certain that it would take just a day or two to get a very simple barebones HTML renderer which supports <img> and a couple of formatting options.

At this point I wish we had a tty which supports variable width and size text and inline images and video. Then I could be happy.

jodrellblank · on March 2, 2022

It's 2022 and people like the blog author are all "who needs screenshots? who needs audio? who needs colours? who needs fonts or styles? who wants to see their handwriting strokes? who needs metadata? I use text because it's 1970!".

Hello? 4K video in my pocket, structured data streaming out of every system around me, all I'm supposed to want to capture is plain text? What a paucity of information, of detail, what an absense of dreaming, what a position taken purely from fear - never have nice things because you might lose them one day!

cobbaut · on March 2, 2022

So when you need to remember a user/password combo for a new device, offline doorbell maybe, or a parcel number, or the address of a friend, then you record a 4K video instead of writing it in a text file?

Plain text just works, everywhere, all the time.

qayxc · on March 2, 2022

> Plain text just works, everywhere, all the time.

No it doesn't. Try a plain text description of an electronic circuit vs a circuit diagram.

Try a a plain text instruction sheet vs an illustrated one.

Try plain text sheet music vs actual sheet music.

Try a mathematical plot vs plain text tables.

Try concept drawings of product vs text descriptions.

Try construction drawing vs plain text descriptions.

I could go on and on.

Narrow views on the world result in flawed thinking and absolutes that are just plain wrong.

the_other · on March 2, 2022

Some good points here. In defence of the original point, you could extend the "plain text everywhere" to "open formats, as text-based as possible" and keep the spirit. I'm personally in the "markdown for most things, with inline images" camp (with regards my own notes).

This one caught my eye tho:

> Try plain text sheet music vs actual sheet music.

Conceptually, "sheet music" *is* a plain text of music.

I accept that technically it would need to be implemented very differently as we haven't got ASCII or UNICODE for sheet music (AFAIK). You could probably do it with an XML language, or at a stretch YAML, and it'd be close human-readable and still vaguely plain text.

morganvachon · on March 2, 2022

> No it doesn't. Try a plain text description of an electronic circuit vs a circuit diagram.

Maybe it's just me, but I do this at work all the time. I create build sheets out of just plain text before I fire up the circuit simulator, so I've got a good mind map of it and something to refer back to. Granted I'm not dealing with advanced circuits, it's usually industrial machinery and motor control, or at most a simple digital timer circuit, but I do just fine with plain text for something highly technical.

sicariusnoctis · on March 2, 2022

> No it doesn't. Try a plain text description of an electronic circuit vs a circuit diagram.

I'm not an EE, but there's SPICE for regular electronic circuits and SysVerilog et al for logic circuits. Math can be typeset in LaTeX, though its plain text readability depends on the formula. Category theory diagrams can be written in a simple language. Sheet music is some sort of XML, though I imagine there could be a more human writable/readable plain text format for this too. There was also a recent post about Markdown diagrams. And of course, code is still entirely text.

Much of what you mentioned can be described in a concise, portable text-based language. Though these documents are readable as text, they can be further rendered for best readability. Luckily, many can also be live previewed.

qayxc · on March 2, 2022

> there's SPICE for regular electronic circuits and SysVerilog et al for logic circuits

Both are inputs for programs, not actual representations for humans to work with and -understand. Try and design or better yet fix actual hardware with a SPICE netlist or Verilog program, good luck with that. Different use cases require different representations and the world doesn't just consist of software.

> Category theory diagrams can be written in a simple language

Category theory diagrams are not curves or function plots.

I get the feeling that you are confusing file formats with representation. If I need external software to actually render the input (which can be binary or text for all I care) into a human-readable and -consumable format, what good is text?

What's the "readability" of an SVG, LaTex, or worse - MS DOCX - document?

How is this:

  Example netlist
  v1 1 0 dc 15
  r1 1 0 2.2k
  r2 1 2 3.3k     
  r3 2 0 150
  .end

even remotely comparable to this: https://sub.allaboutcircuits.com/images/01004.png

And again, I'm not talking about document formats for software - I'm talking about data representation for humans.

Siira · on March 4, 2022

> Try a plain text description of an electronic circuit vs a circuit diagram.

It’s called a Hardware Description Language. According to my textbooks, they’re the industry standard. My personal experience certainly prefers them to unusable “diagram” drawing GUIs.

> Try a mathematical plot vs plain text tables.

You can do plots in unicode. And more importantly, the code that produces the plot is plain text.

Pretty much anything regular can be done better with a plain text source that subsequently compiles to more visual mediums.

qayxc · on March 4, 2022

> It’s called a Hardware Description Language. According to my textbooks, they’re the industry standard

So when confronted with a faulty circuit board with multimeter in hand and bench power supply on stand-by, you are looking at VHDL like this

  Library ieee; 
  use ieee.std_logic_1164.all;
  
  entity mux is
     port(S1,S0,D0,D1,D2,D3:in bit; Y:out bit);
  end mux;
  
  architecture data of mux is
  begin 
     Y<= (not S0 and not S1 and D0) or 
        (S0 and not S1 and D1) or 
        (not S0 and S1 and D2) or
        (S0 and S1 and D3); 
  end data;

to figure out which components might be broken when there's a buck converter that gets no power or where the capacitor is located that should smooth out incoming voltage on a certain bus. Yeah, sure.

> You can do plots in unicode. And more importantly, the code that produces the plot is plain text.

So you're one of those people who don't listen to electronic music and prefers to read the parameters of the VCAs, LFOs and VCFs instead? After all it's just code that produces the sound waves in the end, no?

> Pretty much anything regular can be done better with a plain text source that subsequently compiles to more visual mediums.

And again we're back to data storage vs representation. I'm interested in the representation, not the storage format.

If you're one of those geniuses that can squint at a quintic formula and say - yep, there's a local maximum at about x=-9.7513, more power to you. The rest of us prefer a curve.

seanw444 · on March 2, 2022

This is where Org-mode succeeds over Markdown. Inline image support adds a whole new level to plaintext markup.

Sure, you could implement an inline-image-viewing ability into your editor for Markdown, too, and I'm sure someone's done that for Emacs (maybe it's a default feature I haven't enabled), but that's the vanilla functionality for Org-mode. Not a secondary-layer afterthought.

lkxijlewlf · on March 2, 2022

Plain text can be used to generate those. Anything other than plain text is merely a visualization.

qayxc · on March 2, 2022

Which makes you require software again, so what's the point exactly?

A cool idea for a logo stored as SVG cannot be visualised by a human, neither can text data rows of a CT scan or a stress visualisation from an FEM or FEA simulation.

What you call "merely a visualisation" in many cases is the data you (as a human being) are actually interested in; the storage format is an implementation detail and I'm very surprised how this clear distinction seems to be ignored by some.

jodrellblank · on March 2, 2022

I often see a serial number on a device and take a photo rather than typing on a phone keyboard. Or use software which blocks clipboard functionality and take a screenshot instead of typing it out, yes. OneNote does character recognition on all images and makes them searchable. If a doorbell or key safe has a keypad and I need my computer to look up the code, what difference does it make if the code is text or photo of scribble? Because one day my computer might forget how to open bitmap files? How likely is that? The safe will have turned to dust before all software which can read JPGs is lost to history).

> "Plain text just works, everywhere, all the time."

My point is not "text doesn't work", but it would be good if other things worked that well and there's no good reason why they can't. You don't eat plain oatmeal for every meal because it works, and shun restaurants because one day they might close. You don't plug your ears because you can read lyrics and sheet music and one day you might go deaf. This is like the "first class functions" of programming. Wanting first class functions is not saying that plain loops don't work, it's saying that more powerful things are desirable even if they aren't cross-language compatible.

IshKebab · on March 2, 2022

Obviously not. Nobody is saying that plain text is never the right solution. Just that it isn't always the right solution.

benhurmarcel · on March 2, 2022

What about a note of the dimensions of a room, furniture, or piece of equipment? Suddenly you need a diagram.

mro_name · on March 2, 2022

I find Edward Tuftes reflections on powerpoint vs. engineer-prose very eye-opening.

https://www.edwardtufte.com/bboard/q-and-a-fetch-msg?msg_id=...

Prose takes time, contemplation and thought. Leslie Lamport says similar.

Helps with understanding however not so much with selling.

murermader · on March 2, 2022

I mean you could also just use Markdown. Put the image in the same folder, so you don’t loose it, and then use a relative link from your text file.

If you are using a markdown viewer it should render nicely, if you are using a text editor there is just a path where the image should be, so also no problem.

Some editors like Typora can be configured to automatically save images next to the markdown file, which is really helpful.

Dave3of5 · on March 2, 2022

> but I'm certain that it would take just a day or two to get a very simple barebones HTML renderer which supports <img> and a couple of formatting options.

https://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect

This tells me you've never actually tried this. For example the img tag allows you to, amongst other things, render an svg.

If you want a more full spec you can see here what it takes to render an img element:

https://html.spec.whatwg.org/multipage/images.html

smasher164 · on March 2, 2022

> For example the img tag allows you to, amongst other things, render an svg.

I don't think the GP meant "write an image renderer" when they said "get a very simple barebones HTML renderer". Most languages have libraries for encoding/decoding images in various formats, so aside from how it would fit into document layout, this is a non-issue.

emptyparadise · on March 3, 2022

What I said right before the part you quoted was "Writing a modern browser may be an impossible feat" and the part you quoted talks about a "barebones HTML renderer". What I'm describing is labels and images in a VStack. This is something any operating system UI toolkit (open source ones too!) can do.

gatonegro · on March 2, 2022

> Every few years a new company says you should use their special format. You have to pay them a monthly fee to use it — or keep all of your documents in their care. [...] When you store your writing in one company’s unique format, then you need that program to access it. Then the economy takes a turn, they go out of business, and your work is trapped in an unusable format.

This is something I've been dealing with lately. I have over a decade's worth of important personal and work files in proprietary formats, since I used Windows and Adobe's stuff for a long time. These days I'm using Linux exclusively, so my only option to read many of those formats is to hope that things like GIMP and Inkscape are able to open them somewhat accurately. All the documents I created in InDesign are as good as gone. I still have the files, but they're locked behind proprietary software that I couldn't use even if I wanted to. Oh, well...

For creating documents I've been learning LaTeX, and for note-taking I quite like Vimwiki[1]. I don't do nearly as much graphics work as I used to, though I appreciate things like the fact that Inkscape uses plain SVG by default. I'd rather not be locked out of my own files again in the future.

[1] https://vimwiki.github.io/

iLoveOncall · on March 2, 2022

> they're locked behind proprietary software that I couldn't use even if I wanted to

Hum, yes you could. The formats aren't dead.

It's like putting on a blindfold and saying you couldn't see even if you wanted. Just remove the blindfold.

Just install Windows or Mac.

wolverine876 · on March 2, 2022

I use plain text because there is no multimedia equivalent that is universal and end-user controlled, and there are no multimedia equivalents to Vim, Grep, regex, etc. Text is great, but an image, table, or diagram are sometimes irreplaceable.

We think we are great innovators, but we are stuck in 1990. Notice also that there are no Vim-equivalents for phones, even though that platform has been widely used for 15+ years.

Sorry Bill Joy; sorry Bell Labs; sorry (many more); we let you down, and we let down the public, which now has no integrated multimedia format beyond WordPress or Instagram.

syntheweave · on March 2, 2022

I think we're right on the precipice of this changing. PDF has long been the preferred format for dead-tree-like documents. It isn't quite as open, cleanly designed or navigable as we'd like but it has the inertial force to continue staying in use for decades.

Lately I've been working with Inkscape and realized that SVG also has the capacity to be a freeform document tool, and one with a different default context than HTML(scrolling document) or PDF(paginated document). The default setup involves a canvas, it has a notion of object hierarchies that lets you browse content from an outline, it lets you embed assets and layout in a more-or-less arbitrary fashion, it can be coded with JS, and there's some (under-explored) possibility for structured accessibility. It just needs a viewer that approaches the format in a way that makes sense as a target for content creation, beyond its existing role of small vector icons and simple animations.

None of the existing formats have to work exactly right, they just have to be the starting point for a streamlined source; then make a direct viewer for the source and you have the integrated multimedia format.