Jd – JSON Diff and Patch

_flux · 2024-09-09T10:48:00 1725878880

I realize it's nice to use short names for applications, but couldn't this also be have been called jdiff? I feel like the two-letter tool name is exercising developer memory more than strictly required. I have 42 two-character binaries in /usr/bin. Of course, that's still only about 5% of the available two-alphabet names..

The developer can always choose to use a shorted local alias for commonly used tools.

That being said, I wonder if this is much better than difftastic that is more general purpose, but tree-aware? I suppose this one wouldn't care about JSON dictionary key ordering, at least.

keybored · 2024-09-09T10:55:39 1725879339

The real headscratchers are the tools that have a proper name, a shortened name, and a command name:

- Stacked Git

- Shortened: stgit

- Command: stg

Lots of “stgit: command not found” ensues.

mejutoco · 2024-09-09T14:17:12 1725891432

You have a point of course, but I find it funny that the path is not /users/binaries and, instead, it is a similar abbreviation.

In a way, it is a sort of seo race for tool devs.

spencerchubb · 2024-09-09T14:32:15 1725892335

usr stands for user system resources

Tsiklon · 2024-09-09T15:11:28 1725894688

This is a backronym. /usr is the original user home directory location on classic unix.

https://www.bell-labs.com/usr/dmr/www/notes.html

conkeisterdoor · 2024-09-09T14:51:09 1725893469

TIL after so many years that /usr isn't an abbreviation of "user". "UNIX/user system resources" makes a lot more sense in retrospect. Guess I should have RTFM a long time ago!

dunham · 2024-09-09T15:14:04 1725894844

It looks like that's a newer interpretation than the original:

> As such, some people may now refer to this directory as meaning 'User System Resources' and not 'user' as was originally intended.

https://tldp.org/LDP/Linux-Filesystem-Hierarchy/html/usr.htm...

josephburnett · 2024-09-09T17:19:21 1725902361

> couldn't this also be have been called jdiff? I feel like the two-letter tool name is exercising developer memory more than strictly required.

Yeah, in retrospect I should have given this a longer name. I was going for a natural fit with `jq`. ¯\_(ツ)_/¯

> I wonder if this is much better than difftastic that is more general purpose, but tree-aware?

There are quite a few good tree-aware JSON diff tools out there. But I wanted one that could also be used for patching. I've tried to maintain the invariant that all diffs can be applied as patches without losing anything. And I also wanted better set (and multi-set) semantics, since the ordering of JSON arrays so often isn't important.

bugtodiffer · 2024-09-09T14:15:56 1725891356

Just use gron! greppable json

It turns JSON to JS syntax. it"s perfect for these tasks.

https://github.com/tomnomnom/gron

Cu3PO42 · 2024-09-09T06:28:03 1725863283

I have recently used jd with great success for some manual snapshot testing. At $work we did a major refactor of $productBackend, so I saved API responses into files for the old and new implementation and used jd (with some jq pre-processing) to analyze the differences. Some changes were expected, so a fully automatic approach wasn't feasible.

This uncovered a few edge cases we likely wouldn't have caught otherwise and I'm honestly really happy with that approach!

One thing I would note is that some restructurings with jq increased the quality of the diff by a lot. This is not a criticism of jd, it's just a consequence of me applying some extra domain knowledge that a generic tool could never have.

josephburnett · 2024-09-09T17:22:17 1725902537

> I would note is that some restructurings with jq increased the quality of the diff by a lot.

I would really like to know more about these restructurings. Would you mind dropping me an example here or at https://github.com/josephburnett/jd/issues please? There are somethings I won't do with jd (e.g. generic data transformations) but I do plan to add some more semantic aware metadata with the v2 API.

Also, I'm glad this tool helped you! Made my day to here it :)

zachromorp · 2024-09-09T09:41:21 1725874881

Hello! I sometimes have big json files to diff. Its content is a big array with complex object inside. The problem I have with all the diff tools I tried (this one included) is that it can't detect if element is missing. When that happens, it computes a very long diff where it could have just said "element is missing at index N". Are you aware of a tool without such caveat ? Thanks

josephburnett · 2024-09-09T17:26:42 1725902802

> When that happens, it computes a very long diff where it could have just said "element is missing at index N".

That's exactly the problem addressed by this issue: https://github.com/josephburnett/jd/issues/50. And I've created a new v2 format to address this and other usecases. The v2 API will compute the longest common subsequence of two arrays and structure the diff around that (a standard way of producing a minimum diff).

I've just released jd 1.9.1 with the `-v2` flag. Would you mind trying one of your use cases to see if the diff looks any better? I should say something exactly like that "@ (some path) - (some element)".

eequah9L · 2024-09-09T14:35:23 1725892523

I'm probably missing something obvious, but diff seems to be handling this just fine?

    # diff -u <(echo '[{"a": "b"}, {"c": "d"}, {"e": "f"}]' | jq) <(echo '[{"a": "b"}, {"e": "f"}]' | jq)
    --- /dev/fd/63 2024-09-09 16:31:23.376841575 +0200
    +++ /dev/fd/62 2024-09-09 16:31:23.376841575 +0200
    @@ -3,9 +3,6 @@
       "a": "b"
       },
       {
    -    "c": "d"
    -  },
    -  {
         "e": "f"
       }
     ]

josephburnett · 2024-09-09T20:32:41 1725913961

Yeah that works. But I also wanted the ability to produce JSON Patch and JSON Merge Patch formats. And to support set semantics, identifying objects by specified keys. And it works on YAML too.

deepakarora3 · 2024-09-10T17:40:32 1725990032

There is a diff functionality which I have provided in unify-jdocs that I think does exactly what you are looking for. You can get the details here -> https://github.com/americanexpress/unify-jdocs. At present it is only for Java. And if you do take a look, please feel free to give feedback - thanks.

bugglebeetle · 2024-09-09T13:52:28 1725889948

IIRC DeepDiff does something like this:

https://github.com/seperman/deepdiff

Karupan · 2024-09-09T01:40:21 1725846021

Looks neat! Definitely more reliable than my hacky jq script[0] which I had to write for envs with only sh and jq

[0] https://gist.github.com/Checksum/17c84306f563eca40b353f6ed83...

newman314 · 2024-09-09T00:48:21 1725842901

What I've done in the past is pipe JSON into gron then diff. Works sufficiently well for eyeballing.

planetpluta · 2024-09-09T00:24:53 1725841493

This tool looks great! I’ve been using difftastic lately, which does a fairly good job but struggles with big json files.

One feature I’ve yet to see is applying jq query syntax to the jsons before the diff

josephburnett · 2024-09-09T17:27:51 1725902871

> One feature I’ve yet to see is applying jq query syntax to the jsons before the diff

Will you please add this as a feature request? https://github.com/josephburnett/jd/issues. I would like to hear more about how you would use it.

ramnes · 2024-09-10T07:01:39 1725951699

Have been using it in a Go project lately, wonderful library! Ended up with jd after trying a few others that couldn't handle edge cases, such as creating a diff between `[]` and `{}`. Love the diff format as well.

iwwr · 2024-09-09T09:40:33 1725874833

Is this useful for large json files, on the order of GiB?

dgelks · 2024-09-09T07:47:30 1725868050

Was just using it to compare two massive json files, super performant and useful compared to using jq

agumonkey · 2024-09-09T12:07:53 1725883673

We're not that far from jsolog.

mentalgear · 2024-09-09T08:10:17 1725869417

very very nice, just the tool I needed for the current task - and here it is! :)

surfingdino · 2024-09-09T09:41:47 1725874907

Use it daily to make JSON payloads more readable. One of Open Source true gems.

swah · 2024-09-09T11:03:58 1725879838

I use jq for that!

gvv · 2024-09-09T09:28:06 1725874086

nice, super useful for debugging API responses. Would be nice to be able to use it as a VSCode extension!

josephburnett · 2024-09-09T17:30:04 1725903004

> Would be nice to be able to use it as a VSCode extension!

I've added support to use jd as a Git diff engine: https://github.com/josephburnett/jd?tab=readme-ov-file#use-g.... Can you configure VS Code use a custom command to show diffs?

g_dhoot · 2024-09-09T01:51:02 1725846662

This looks neat and useful!