Go's Concurrency Examples in Java 19

ibraheemdev · on May 4, 2022

> There is no way for your operating system to know exactly how much stack space a thread will need so it allocates an amount on the order of around a megabyte. You only have around a bakers dozen gigabytes of RAM, so you can only make give or take 10,000 threads.

That's not true at all. The megabyte(s) of stack space is virtual, spawning a thread only requires a few kilobytes of memory initially, and is a lot cheaper than many realize.

emccue · on May 4, 2022

Here is the part where I admit that I am an idiot and don't actually know.

All I do know is that in Java I cannot create this many threads and in the talks given about Project Loom this is given as the explanation as to why.

https://www.youtube.com/watch?v=EO9oMiL1fFo @ ~4:00

Is it maybe not the actual spawning but some "switch" that flips once you start using the thread?

I'll update the post or leave a comment with a correction once I understand exactly what you mean

hddqsb · on May 4, 2022

The explanation in the video is good, here is a transcription for convenience (emphasis added):

> OpenJDK implements threads as thin wrappers around the OS thread [...] OS threads are expensive, by which I mean that we cannot have too many of them. Maybe a few thousand active ones, maybe ten thousand active ones, but that's about it. And the reason for that is that the OS doesn't know the various languages, and the most costly component of a thread is its stack, and the operating system doesn't know how the various languages it might support makes use of the stack. So it must address the worst case and allocate a very large stack. It's usually on a megabyte scale. And even though we use virtual memory and not all of it is committed to physical memory, once we touch a page it becomes committed – it can't be uncommitted again because the OS doesn't know how the stack is used. And in any event, all the operations are done at a page granularity which is 4k, it's also not that small.

There may also be arbitrary OS-specific limits. For example on Linux there is system-wide limit on the number of threads, controlled by /proc/sys/kernel/threads-max (docs: https://www.kernel.org/doc/html/latest/admin-guide/sysctl/ke..., see also: https://stackoverflow.com/a/8849114). On my system it's ~60,000 (with 8G RAM).

kevincox · on May 4, 2022

> once we touch a page it becomes committed – it can't be uncommitted again because the OS doesn't know how the stack is used.

I wonder if this could be solved by the code itself. Every so often tell the OS that you don't need all of the stack pages that are currently unused.

Of course getting the "every so often" right to avoid a huge slowdown in edge cases isn't easy.

chii · on May 5, 2022

for java, i wonder if you could make the stack-size smaller to increase the number of threads possible (as long as you know your application can handle this smaller sized stack - aka no recursion etc).

vlovich123 · on May 4, 2022

With Fibers you can implement user-space switching between virtual threads. For example, if a thread is going into a top-level select, switch it to a small stack putting the original large stack into a pool. On wake resume switch back to a larger stack. That’s basically what Project Loom is doing I suspect.

emccue · on May 4, 2022

No, I understand how Loom limits the amount of stuff on the stack. Its just I thought threads always had those large stacks.

Based on what the top person said, I'm thinking that they don't get a large stacks until they cross some threshold of use, but I want to make sure.

vlovich123 · on May 4, 2022

Stacks are allocated statically upfront. However, you can change the size the kernel allocates, you can point them to use a custom memory space instead of the default one and you can change the stack you're running on. Also, like all memory pages, stack space isn't actually allocated until you modify it so if your stack depth doesn't get crazy large/you're not storing large objects on the stack, it's possible your 16 MB stack is only using 1 MB or so of real RAM.

EdwardDiego · on May 4, 2022

In Java, it defaults to 1MB on platforms other than Windows.

I've seen people OOM themselves by setting the default thread stack size massively high - they had likely gotten confused with the args (they'd used -Xss when I think they meant -Xms) and given their threads 1GiB of stack each.

I can't speak to how Java manages stack size between itself and the OS though, I'm sure there's deep magic there.

taspeotis · on May 4, 2022

It's also not true when ... you tell it how much you need. Windows lets you specify a stack size [1].

[1] https://docs.microsoft.com/en-us/windows/win32/api/processth...

maeln · on May 4, 2022

Fun fact, you can declare a stack size at compile time (something like `cl ... -stack:0x5000000,0x5000000 ...`) which let you create insane stack size so that you can fit your all your software in the stack. Useful when you don't want to had an allocator when making tiny tiny executable files :D

SemanticStrengh · on May 4, 2022

The main cost of spawning a thread is simply the time it takes to spawn.

latch · on May 4, 2022

What are you trying to say? This sounds like a tautology to me.

MikeTheGreat · on May 4, 2022

I think they meant that the main cost of a new thread that you'd like to create is the time it takes to spawn (i.e., the time it takes to set up the new thread and get it ready to run) and NOT the memory that gets allocated for it, nor the operating system overhead of keeping it around afterwards, etc, etc.

(I'm not agreeing or disagreeing; I'm only trying to helpfully clarify)

kevincox · on May 4, 2022

They are saying that most of the cost is startup cost, not running overhead.

SemanticStrengh · on May 4, 2022

This is not a tautology.

torginus · on May 4, 2022

But does Loom reclaim allocated stack space once the thread's stack shrinks?

raducu · on May 4, 2022

> But does Loom reclaim allocated stack space once the thread's stack shrinks?

Could you elaborate why you asked this? Is it because it's not done already? I'm genuinely curious.

Skinney · on May 4, 2022

I believe so, yes.

pron · on May 4, 2022

People sometimes respond with some puzzlement to InterruptedException, unsure what they should do. All it means is that some other thread has requested that this thread stop what it's doing. So do what you think your code should do in that situation. E.g. in the case of `say`, the most reasonable thing is to simply return.

You shouldn't normally call `Thread.currentThread().interrupt()` inside a `catch (InterruptedException e)` unless, perhaps, you don't throw anything. That would prevent any blocking operation inside cleanup code from working. As a general rule, a thread's interrupt status is set or InterruptedException (or some other exception) is thrown, but not both. That is how JDK methods also do it. If they throw — they clear the interrupt status.

emccue · on May 4, 2022

Aight.

I only started doing this recently after this advice here

https://discord.com/channels/544924237773668373/930457954346...

I'll clear it from the examples

K2L8M11N2 · on May 4, 2022

Just FYI, that link is not publicly accessible.

marginalia_nu · on May 4, 2022

While there are use cases where lightweight threads are a benefit, I think a web crawler is pretty much the ideal case for async/await, given you are almost exclusively waiting. Although I must admit I don't really understand this example, why is the fetching being parallellized, like you typically want to add sleeps, even with a single thread you've got yourself a gatling gun that can DoS a website if you aren't careful.

There's also some optimizations that could be made. Instead of having

    synchronized (seen) {
        if (!seen.contains(u)) {
            seen.add(u);
            // ...
        }
    }

you can get rid of the synchronization with

    // seen = ConcurrentHashMap.newKeySet()
    
    if (seen.add(u)) {
        // ...
    }

vips7L · on May 4, 2022

I was surprised by the lack of ConcurrentHashMap. Does Go not have an equivalent?

racingmars · on May 4, 2022

Go does have a concurrent map in the standard library, but the documentation recommends not using it unless you have specific access patterns that it's optimized for:

https://pkg.go.dev/sync#Map

You lose type safety when using it. I'm guessing with generics in 1.18 now, we'll see an updated generic version of sync.Map whenever a bunch of new collection types are added to the standard library that take advantage of generics.

EdwardDiego · on May 4, 2022

Yeah, I'm looking forward to Go's stdlib evolving useful collections now that generics exist.

Like maybe a set. That isn't a map[T]interface{} and can do things like unions and intersections so I don't have to implement it with for loops.

masklinn · on May 4, 2022

> That isn't a map[T]interface{}

FWIW while map[T]interface{} is somewhat more convenient to use, for efficiency reasons you should prefer map[T]struct{}: interface{} being a fat pointer is takes 16 bytes * the capacity of the map/set. struct{} is a zst, so takes 0.

It also signals much more clearly that it's a map to nothing, which is a set, a map[T]interface{} could be an actual map to a bunch of random things.

Cthulhu_ · on May 4, 2022

While you're not wrong, and that is how you implement a set in Go, it's still a workaround over a standard library Set type unfortunately.

But, with Go's generics, I hope they will add new collection types to the standard library.

assbuttbuttass · on May 4, 2022

It's actually much more common to use map[T]bool, because it allows easy membership testing just with

if m[x] { ... }

vlowther · on May 4, 2022

Not that I have seen. map[T]struct{} is more common, with if _,ok := m[x]; ok {} used for testing if a key is in the map. The space savings really add up with large sets.

assbuttbuttass · on May 4, 2022

At least at Google, map[T]bool is more common, by an order of magnitude. Probably because map[T]bool is recommended by our official style guide, which tends to favor readability over a negligible performance difference.

masklinn · on May 4, 2022

That is largely a distinction without a difference, the main gist of my comment was that using an interface{} value, while convenient in some ways, is way costlier than necessary (at least in memory) without any real advantage.

vips7L · on May 4, 2022

This is exactly why a generic Set collection should have been there since the start.

masklinn · on May 4, 2022

I don't disagree at all, I'm just pointing out a less than ideal pattern.

EdwardDiego · on May 5, 2022

And I much appreciate your feedback :) Whenever I try to think of what was the best practice idiom, I always mix up map[T]struct{} and map[T]interface{}, due to the curly brackets at the end.

As you can tell, it's been a while since I've written Go.

EdwardDiego · on May 4, 2022

Hah, 50/50 I'd get that wrong, cheers.

idoubtit · on May 4, 2022

In Go, while it is possible to use a concurrent hash map, it's generally not recommended. In most cases, you should use a channel.

In one of my first Go programs, I used mutexes to avoid concurrent writes in a critical part. Then I moved the writing process into a goroutine that receives its input through a channel of size 1. In my case, the later code was easier to maintain and more performant than the former, mutex-based, code.

emccue · on May 4, 2022

Yeah - ConcurrentHashMap makes the most sense. I left it off just because I thought it would be easier to see the parallel with the Go code.

ziihrs · on May 4, 2022

In Go, the select branch to take is non-deterministically chosen from the set of enabled branches. For the examples I don't think it makes a difference, but in general you cannot assume that a select attempts to choose branches from top to bottom.

Cthulhu_ · on May 4, 2022

I wonder if that's on purpose, to find bugs earlier. I know that the implementation of map in Go will intentionally shuffle the keys when you try to iterate over them, so that you never accidentally rely on insertion order.

dgb23 · on May 4, 2022

It's very much on purpose, just like default maps being unordered. People should not rely on the ordering as it breaks the semantics of these features according to Go. It fits the Go philosophy well.

Strum355 · on May 4, 2022

Its non deterministic on purpose because thats how its required to be to fit the model of CSP (communicating sequential processes)

GrumpySloth · on May 4, 2022

I've read somewhere (possibly when Go came out) that it's on purpose. For fun, Alan Cox once made a post on Google+ (RIP) showing a crappy random number generator based on Go's select.

funcDropShadow · on May 4, 2022

Loom's virtual thread are much more similar to GHC's threads than to go routines. And GHC Haskell is also able to spawn millions of threads, trivially.

emccue · on May 4, 2022

Can you elaborate more? What is the distinguishing factor?

matttproud · on May 4, 2022

> In Go there is less noise, but also there no way to interrupt Go's time.Sleep.

You can do so in one of two ways using the context API (https://go.dev/blog/context) while keeping the code synchronous and idiomatic:

    package main
    
    import (
     "context"
     "fmt"
     "time"
    )
    
    func say(ctx context.Context, s string) {
     for i := 0; i < 5; i++ {
      select {
      case <-time.After(100 * time.Millisecond):
       fmt.Println(s)
      case <-ctx.Done():
       return // Operation canceled or timed out.
      }
     }
    }
    
    func main() {
     ctx := context.Background() // Use context cancellation semantics.
     go say(ctx, "world")
     say(ctx, "hello")
    }

If the code is reasonably performance sensitive AND there's a high degree of likelihood the context is canceled before the operation completes, you can use a time.Timer similar to what follows:

    for i := 0; i < 5; i++ {
     func(i int) { // Wrap in closure expressly to support defer statement.
      t := time.NewTimer(100 * time.Millisecond)
      defer t.Stop()
      select {
      case <-t.C:
       fmt.Println(s)
      case <-ctx.Done():
       return // Operation canceled or timed out.
      }
     }(i)
    }

xyproto · on May 4, 2022

Judging by the examples, the Go code is easier to read, write and maintain.

pron · on May 4, 2022

The examples translate Go idioms directly to Java code rather than using Java idioms; you wouldn't normally write Java like Go any more than you'd write Go like Java.

They also make use of constructs that are overkill for such simple examples. For example, the first example could be written as:

   Thread.startVirtualThread(() -> say("world"));
   say("hello");

which would also replicate same concurrency bug as in the original Go program (the Java example, as written, does not have that bug, but neither would

   var t = Thread.startVirtualThread(() -> say("world"));
   say("hello");
   t.join();

).

Go makes certain concurrency bugs easy to introduce and hard to find, while Java helps avoid them, but that is also not easy to see from these particular examples.

cle · on May 4, 2022

Is there an idiom in Loom for cancellation propagation and deadlines? I can't find anything in https://download.java.net/java/early_access/loom/docs/api/ja... about it. Do we need to run a second virtual thread that waits on the cancellation condition and interrupts, and then handle that exception gracefully in the interrupted virtual thread?

pron · on May 4, 2022

https://download.java.net/java/early_access/loom/docs/api/jd...

cle · on May 4, 2022

Thanks!

dhab · on May 4, 2022

After mulling between Elixir vs Go to have a tool in the toolbelt covering the use-case of concurrent server-side apps, I picked a book on Go and went through the first chapter.

Disenchanted to hear (some) concurrency bugs easy to introduce and hard to find in Go. Could someone mention how does this compare to Elixir?

xyproto · on May 5, 2022

It all depends on what you're trying to do.

These cases work flawlessly in Go, in my experience:

* Batch 10 tasks and run them concurrently (using WaitGroups).

* Locking resources while reading and writing with a mutex like sync.RWMutex, using .Lock() and .Unlock(), often together with defer.

* Locking resources while only reading with a sync.RWMutex, using .RLock() and .RUnlock(), often together with defer.

* Running code "in the background" with goroutines, ie. go func() { <code goes here> }().

* Detecting race conditions by building with go build -race and then running the program.

However, when using channels, select, closing channels, waiting on channels in a for loop etc, things can easily become more subtle.

On a general basis, I disagree that concurrency bugs are easy to introduce and hard to find in Go. However, pure functional programming languages like Haskell, with little state, will always have the upper hand when it comes to concurrency.

See also:

https://gobyexample.com/waitgroups

https://gobyexample.com/mutexes

https://gobyexample.com/defer

https://gobyexample.com/goroutines

https://gobyexample.com/non-blocking-channel-operations

kaba0 · on May 9, 2022

Unfortunately go’s defer statement is function scopes so it is very easy to not unlock a mutex at the intended place.

emccue · on May 4, 2022

Yeah, sorry.

I would have used .startVirtualThread more if I wasn't under the impression that I needed to do Thread.currentThread().interrupt() when catching and re-throwing as a RuntimeException.

So I was choosing between the noise of that in the examples and the noise of the executor + submit with a lambda that returned null.

threeseed · on May 4, 2022

That's more to do with your preferences.

For me Golang has far weaker and less flexible error handling semantics.

And concurrency is more verbose in Java but it's easy enough to abstract it behind a wrapper.

Where as you can't escape Golang's error handling.

Cthulhu_ · on May 4, 2022

Of course, you have to consider whether you SHOULD escape it.

I do believe you probably have more control over Java's threads, making them a bit more verbose. But, both Java itself - as you mention - and languages built on top of the JVM have ways to make it less verbose.

shp0ngle · on May 4, 2022

Well, that's because goroutines and channels are something, that's first class in go and has a special syntax.

While in Java, it tries to mostly keep the existing Java syntax.

I guess people that write Java are used to a lot of "lasagna" all-is-object boilerplate and ignore it. The same way go people are used to ignore the `err != nil` lines everywhere.

ivolimmen · on May 4, 2022

Well as someone that has been developing software in Java for 20 years and never touched Go; it's the other way around.

doctor_eval · on May 4, 2022

I developed and maintained Java software professionally for over 20 years, since 1.0, but now I work in Go. There is no way I’d willingly go back to Java, and these examples really underscore some of the reasons.

I do however wonder, now, what I’m missing out on with other languages, particularly swift and rust.

bsaul · on May 4, 2022

Swift has great support for enums (and ADT in general). That's about the only thing i miss when i switch from swift to go. Generics and protocol oriented programming is nice, but the ergonomics makes it still not that enjoyable (you start banging your head against walls pretty quickly beyond the most simple examples).

kaba0 · on May 4, 2022

Rust has a great, Haskell-inspired way to create algebraic data structures (with the enum keyword). These same structures can be very nicely destructured and pattern matched on.

Cthulhu_ · on May 4, 2022

Can you elaborate? I mean there may be some new operators and concepts in the Go versions, but they're explained succinctly in https://go.dev/tour/

kaba0 · on May 4, 2022

Java is a similarly small language, with perhaps even less of a learning curve.

sdevonoes · on May 4, 2022

Judging by the examples, when it comes to code that deals with concurrency both Go and Java are difficult to maintain. Plenty of times I have dealt with "clever code" (written by others and by my past self) that works 99% of the time.

deepsun · on May 11, 2022

Go solution to TreeWalk problem is wrong, or, rather, not real-world, because it has a memory leak. You need to make sure all goroutines finish, and not stuck on reading/writing to a channel.

In Kotlin coroutines you can .cancel() a coroutine, but Go doesn't allow to cancel goroutines.

rad_gruchalski · on May 3, 2022

time.Sleep(100 * time.Millisecond) could also be: <-time.After(100 * time.Millisecond) so that an imaginary timeout can be easily handled with a select.

nahuel0x · on May 4, 2022

Be careful with time.After, if you select on time.After and another channel c, and c breaks the select, then the timer will not be deallocated until it expires. So if the select is on a loop you can easily leak memory. Another of the Go hidden gotchas.

emccue · on May 4, 2022

True, but I didn't write any of the Go here. Somewhat out of my wheelhouse

orthogonal-wren · on May 4, 2022

Good read! I just saw that in one examples you create and start a thread by invoking #startVirtualThread, and then you call start on the returned thread again.

emccue · on May 4, 2022

Oof, fixed

nyanpasu64 · on May 4, 2022

Interesting article history! Does your site have a RSS feed to keep up with updates in a feed reader? I couldn't find one.

sidcool · on May 4, 2022

How is this different than Kotlin coroutines? They seem to already support this feature.

pjmlp · on May 4, 2022

First class support on the JVM, instead of .class boilerplate emulating the feature, usable only for Kotlin.

emccue · on May 4, 2022

The example I think is compelling is JDBC.

Nearly all SQL Database access goes through it and it won't be rewritten in any practical time scale. It can't be meaningfully async without a rewrite and kotlin's coroutines can't change that.

With virtual threads that synchronous api can be used at the performance level of what a hypothetical JDBC rewite would give.

(Not that you want to hit your DB that much, but you know what I mean. You don't need to specially wrap or bundle apis to get into an async call juggle.)

orthogonal-wren · on May 4, 2022

There are already some efforts in that direction. You can check out r2dbc and the many vert.x sql clients. While they're not really jdbc, they're fully async like you described.

caladin · on May 5, 2022

I was investigating this lately, and as you say, they are "efforts", and while laudable, I wouldn't go so far as to call them solutions. There are open issues/caveats as a result of the async model.

I do think there is something great about being able to continue to leverage existing JDBC-related tooling, given that it already exists.

funcDropShadow · on May 4, 2022

And the importance of this first class support cannot be overstated. It means: - meaningful complete stacktraces - debugger support - all blocking libraries get the benefits of async code automatically [1] - all monitoring features of the platform work - profilers get more useful again - less mental overhead when programming, i.e. simplicity

[1]: With one caveat: Libraries which call directly into blocking native calls do not become async automatically. But they are very rare on the Java platform.

jillesvangurp · on May 4, 2022

Kotlin's co-routine framework will be able to use Loom style green threads as soon as it comes available. Until then, it uses its own callback based mechanism to schedule co-routines (very similar to using Futures or Promises technically).

If you are already using co-routines, there's very little you will need to do other than updating the library when they add the support for this. Other than that, it indeed already has essentially all of the feature set that Loom adds to Java. So, do several other frameworks like Spring's flux, vert.x, rx java, etc. Kotlin's co-routine framework actually has extension functions for all of these as well as things like threadpools, worker threads, etc. in Java.

The value of Loom is mainly adding low level JVM support for this. Other frameworks will evolve to use that as well. The Java API to this is probably not that interesting; even if you do use Java. You probably should be using something a bit higher level. There are quite many frameworks for Java implementing all sorts of ways to do the same things. All Loom does is add a few useful things for these frameworks to do their thing a bit faster.

Kotlin's co-routine library is actually a multiplatform library that works in natively compiled kotlin on IOS/linux/windows/etc. (and soon wasm), in kotlin-js in a browser or node.js, and in the JVM. On each of these platforms, it uses a platform specific implementation as well as ways to seamlessly interact with platform specific things (e.g. promises in a browser, futures, threadpools, spring fluxes, etc. on the JVM). So, how co-routines are executed and scheduled is platform specific. The resulting code tends to be very portable and look very similar. I actually maintain a few multi platform kotlin things that support co-routines and they work great on both the jvm and in a browser.

When Loom releases, it will just be yet another thing that co-routines can be executed with. On the JVM, you can trivially create a co-routine dispatcher from a thread pool. A co-routine dispatcher is basically where you schedule your suspend functions to run. Loom basically just provides alternate threadpool and thread implementations that follow the same API that use the underlying jvm functionality (which is the really interesting thing it adds). So, the changes for Kotlin's co-routine library to support this are going to be pretty minimal. In fact, a lot of its current extension functions should just work against the Loom API without any changes.

There's nothing in the Loom apis that is particularly interesting if you use co-routines but the low level JVM optimizations are of course interesting from a performance point of view. Also interesting is the improved ease of dealing with stack traces; which can be a bit of a pain point with co-routines on the jvm. So, lots of good reasons for Kotlin users to look forward to this but no big reason to wait for it. You can use co-routines right now and the same code will continue to work great when Loom gets there in an LTS version in a few years. But it might run a little faster and be a bit easier to debug.

Skinney · on May 4, 2022

> You probably should be using something a bit higher level.

While there's nothing wrong in using something higher level, the main reason for using higher level libraries are because using threads doesn't scale past a certain point.

The problem with libraries that try to work around the scalability issue is one of compatibility. Using co-routines is great, but you need workarounds when targetting JDBC or a Http client. You have to be using coroutines, or coroutine friendly code, everywhere, or you risk reducing performance.

For this reason, we just use regular threads and threadpools in our projects. Once Loom lands we'll change the executors to spawn virtual threads and then we'll be done.

That said, coroutines shine if you can't wait for Loom to land or if you're targetting something other than the JVM (like Android).

jillesvangurp · on May 4, 2022

I think you have a wrong concept of what kotlin co-routines are and are assuming limitations that it simply does not have. Anything you can do wit threads, executors, etc. You can trivially use with co-routines. Better still, mostly it uses those things directly and there is no performance overhead.

I've used co-routines with:

- spring flux

- hibernate & jdbc (not a problem, you just use your connection pool as usual and use a threadpool based coroutine context to ensure you don't block your asynchronous request handling main thread. You might prefer switching to some more modern versions of that that support asynchronous IO though. But that stuff is still relatively new and many people stick with blocking IO for this for now. Not in any way a show stopper for using co-routines.

- apache httpclient with synchronous IO and a connection pool (same thing basically).

- apache httpclient with asynchronous IO (very easy to integrate it's callbacks with suspendCoroutine { ... })

- redis (both synchronous and asynchronous)

- mongodb

- Java stream API (there are loads of Java libraries that use that)

I get it, some people don't want to switch language. So, stick with Java. But do it for the right reasons. Performance really isn't one. Anything Java does, Kotlin can do too. And generally about as fast by virtue of simply using the exact same things in the standard library. And generally without the boilerplate and a lot of syntactic sugar (i.e. it's nicer). Probably if the Kotlin version of something is slower than the Java version, it's because you are doing it wrong. I can't really think of a lot of cases where that would not be easy to fix.

I've seen my fair share of buggy Thread based Java logic over the years; trust me co-routines are way nicer than that. You can still get into trouble with them of course (and mostly for the same reasons). But it covers all the bases pretty well and generally does a better job of keeping you out of trouble.

Skinney · on May 4, 2022

> think you have a wrong concept of what kotlin co-routines are and are assuming limitations that it simply does not have.

What I’m referring to is this:

> you just use your connection pool as usual and use a threadpool based coroutine context to ensure you don't block your asynchronous request handling main thread.

For everything that is not coroutine compatible out of the box, you need to do something like what is described above.

When porting an existing application to coroutines, you also have to identify the places you need to introduce thread pool backed contexts and use them approprietly. You also need to setup these thread pools correctly so that you don’t end up using more threads than if you didn’t target coroutines.

For us, this extra complexity wasn’t worth it. If we were making an app from scratch and could choose coroutine friendly solutions for everything from the start, that would be different, but porting a legacy app to coroutines hasn’t been a good experience.

Fortunetly, project loom means all our current thread-based code will be just as scalable as coroutines in the not-to-distant future. That also means we can spawn more (virtual) threads and see a nice speed increase as well.

Btw, we’re already using kotlin for most code, just not coroutines.

jillesvangurp · on May 5, 2022

OK, I get you don't want to do the work to migrate an existing Java application. It's a lot of work. If you don't have a lot of problems with it right now, you could just leave the code as is.

But just so you know, what you are describing is code that is currently boiler plate heavy in Java and essentially making it a lot simpler and easier to reason about with Kotlin. I think you are confusing problem and solution here.

In any case, this is how you create a CoRoutineContext backed by a thread pool.

  val myContext = newFixedThreadPoolContext(100,"db-context")

That's it. There's nothing else to do but "use it". There's no need to call shutDown. Or have some thingy to intercept uncaught exceptions. Or have a lot of boiler plate around using it. a simple launch(myContext) { ... } does the job.

Skinney · on May 7, 2022

most of our thread pools are long lived, so we don't ever shut them down. When the pool is not long lived, we have a try-with-resources wrapper which handles the shutdown.

Most of the time, we do calls to .execute and then join the different threads at the end. The code running within the threads represent the majority of our code, so the boilerplate you're talking about might 10 out of 2000 lines.

Our only problem is the number of threads we can spawn safely, and that limit is removed with loom.

MrBuddyCasino · on May 4, 2022

The interesting aspect is not "running Kotlin coroutines on Loom", but eliminating the need for source-level coroutine keywords and solving the problem of function coloring, no?

jillesvangurp · on May 5, 2022

Sorry, but that's not what Loom is about at all. It's mostly about the jvm internals; that's the main chunk of work. The API level stuff is just designed to be consistent with the existing Thread API, with all it's technical debt, flaws, boiler plate, etc. that has inspired alternative frameworks and approaches to be written. It's consistent with that is the best you can say about it. To me it looks like it's a nod to other frameworks to make it as easy as possible for those to integrate this stuff. Good approach, very valid. It's not designed to be more usable / less boiler plate heavy. That's what other frameworks are for. Loom is not a replacement for these frameworks but something that is designed to seamlessly be integrated by them.

There's only 1 co-routine related keyword in Kotlin: 'suspend'. The rest of co-routines ships in the form of a normal kotlin multiplatform library that you just add to your project. Java has several keywords that you don't need in Kotlin (e.g. synchronized, volatile). You can mark things as synchronized with a simple kotlin function. And likewise there's a Volatile annotation. Mostly that's just useful for kotlin code that needs to be used from Java.

The function coloring bit seems to obsess a lot of people. I just don't see the issue with it. For me the boiler plate that Java makes you use instead is a much bigger issue.

truth_seeker · on May 4, 2022

Sounds like a nice strategy by Jetbrains. Thanks for sharing.

sidcool · on May 4, 2022

Thanks for the detailed explanation.

freakynit · on May 4, 2022

Does GO even makes sense after Java's project loom and graalvm?

klabb3 · on May 4, 2022

Did you see the examples? Even with the blocking threaded style the Java versions gave me nightmares. And they didn't even bring up a cancelation example (not 100% sure on that). Cancelation and subsequent graceful shutdown is one of the hardest concurrency problems that (a) you are very likely to need in practice and (b) conveniently often are left out of example code. Heck it isn't even great in Go (too explicit imo) but at least they solved it.

emccue · on May 4, 2022

Can you elaborate more?

All of the parts that throw `InterruptedException` are handling the thread interrupt cancelation mechanism.

There is also the example with the atomic boolean quit flag.

I'm not convinced that Go has any unique mechanisms here. Just like Java you can't forcefully kill a thread without that thread's cooperation.

mperham · on May 4, 2022

Go provides a context package which allows in-flight cancellation of heavy operations spanning processes and even machines. https://pkg.go.dev/context

mkdirp · on May 4, 2022

This is also part of how you could interrupt `time.Sleep`.

> In Go there is less noise, but also there no way to interrupt Go's time.Sleep.

The full piece would be something like:

    func sleepCtx(ctx context.Context, delay time.Duration) {
        select {
        case <-ctx.Done():
        case <-time.After(delay):
        }
    }

    func main() {
        fmt.Printf("%v\n", time.Now())
        ctx, cancel := context.WithTimeout(context.Background(), 500*time.Millisecond)
        defer cancel()
        sleepCtx(ctx, 1*time.Second)
        fmt.Printf("%v\n", time.Now())
    }

Runnable example on Go playground: https://go.dev/play/p/S5TY3CRmsYO

klabb3 · on May 4, 2022

Yep this is a utility function in sure thousands have written, including myself. It's not in std which is weird but go had been a bit inconsistent since context was a big api change.

Fun fact: that func has a hidden bug, if one should be pedantic. Can you spot it?

eadmund · on May 4, 2022

> Fun fact: that func has a hidden bug, if one should be pedantic. Can you spot it?

Are you referring to the fact that the timer is still hanging around? Would this be the most-correct version?

    func sleepCtx(ctx context.Context, delay time.Duration) {
            t := time.NewTimer(delay)
            defer func() {
                    if !t.Stop() {
                            <-t.C
                    }
            }()
            select {
            case <-ctx.Done():
            case <-t.C:
            }
    }

klabb3 · on May 5, 2022

[Weird, I remember already replying to this comment.]

Yes that's what I was thinking of :)

I'm a bit unsure what's the value in draining the channel (which is internal to this func), given that Go should garbage collect channel (and in this case perhaps even some escape analysis).

EDIT: Ah they aren't just closing the channel, but sending a time.Time. That makes sense, the timer wouldn't have anyone to send to if it's unbuffered.

emccue · on May 4, 2022

Okay this is cool.

I think mechanically the plans for ScopeLocals can be a part of implementing a scheme like that. Either way - I need to read this thoroughly in the morning.

latchkey · on May 4, 2022

I deploy a single golang binary to well over 15k servers (many are only 100mbit) across many data centers and update it many times a day/week. Using xz, it compresses down to about 5mb.

As much as I love Java (I co-founded Apache Java), golang is a great fit for this usecase.

winrid · on May 4, 2022

Okay, I'll bite. So why not distribute a most likely even smaller jar file?

latchkey · on May 4, 2022

The majority of these machines are PXE booted, with a super minimal ubuntu based OS distribution... which would mean distributing a JVM as well.

Total boot payload is about 160megs... which I need to do some more work on... I think I can get this down to about 120megs, but it hasn't been a priority yet since this is working well enough for now. About 54megs of that boot is just some third party drivers and right now, those are .gz encoded... need to switch to xz to bring it down.

AmpsterMan · on May 4, 2022

Not the commenter:

I presume it would have to be because there is no guarantee that the JVM will be installed and configured exactly correctly on all of those machines a priori.

Jlink helps solve that problem if you have a properly modularized project, but that's hard to do if your dependencies aren't also properly modularized.

kaba0 · on May 4, 2022

Isn’t that the case with libc and the like even in case of go? You can’t just expect any program to run correctly on an environment you don’t control.

theshrike79 · on May 4, 2022

Said servers would also need Java installed and updated.

The Go version needs that executable, nothing else.

Cthulhu_ · on May 4, 2022

Doesn't GraalVM as mentioned in the root comment solve that problem? IIRC Graal can compile a JVM + Java application to a single binary, but I could be mistaken.

latchkey · on May 4, 2022

That's the issue with the size of the binaries produced... you're effectively bundling a JVM.

In the same way that golang is bundling a GC implementation... except it is orders of magnitude smaller than a JVM.

sto_hristo · on May 4, 2022

I find it hard to be convinced to use GO at all. If i wanted efficiency, i'd bite and go with something close to the metal. If i wanted something that can build huge applications and has a rich framework ecosystem - i'd go with Java. GO seems like a middle solution for some exceptional cases, which i haven't deal with.

It is a fresh language and i quite enjoyed exploring it, but i just can't see the use of it. (also, the multiple return value functions are great, but if only there was a way to use a particular value directly from the function without having to assign them to variables)

EdwardDiego · on May 4, 2022

Graalvm takes some special effort to get things building natively and the binaries are far larger, so I'm going to go with "yeah".

And I'm hardly a fan of Go.

IceWreck · on May 4, 2022

Why not ? Go binaries don't require a seperate vm/runtime. Theyre faster in some cases and Go has overall simpler to read code than Java.

There is room enough for both languages.

kaba0 · on May 4, 2022

The modern Java packaging also includes the runtime itself, there is not even a JRE anymore. And readability is somewhat subjective, Java is a really simple language, all the writer does is just method calls into various existing libraries. Since those didn’t get a chance to evolve yet with the upcoming virtual threads they are a bit longer at times.

So all what left is opinionated in-language support vs library support, and it is not clear cut which is the winner. Java seems to allow finer control over the primitives of concurrency, so a library or another JVM language could build up a much better abstraction over the mechanism.

Cthulhu_ · on May 4, 2022

I think the differences would drop off significantly if one were to write Java while keeping the Go guidelines and proverbs in mind; that is, reduce or eliminate the amount of 3rd party libraries (you don't actually need a dependency injection / wiring framework for most applications, just instantiate your objects with dependencies by hand in your main method), reduce the amount of abstraction layers, don't try to be clever, stick with regular loops instead of Streams, etc etc.

kaba0 · on May 4, 2022

I agree, though do keep in mind that a complex framework like Spring doesn’t play in the same league as a typical go microservice in terms of what kind of application is getting written in them.

But I really hope that the new, record-based, minimal boilerplate java style gets hyped up more and more, we really should prefer compile time metaprogramming instead of reflection-based solutions. (E.g. mapstruct is really cool!)

nahuel0x · on May 4, 2022

Golang still has an advantage over Java + Loom + Graalvm, JVM as a better GC but Golang using more inline/value types puts less pressure on it and diminishes memory usage. So you need to wait also for Java project Valhalla to match Golang profile.

emccue · on May 4, 2022

Hypothetically there is a version of "static Java" that could start to encroach upon Go's domain. This may or may not come out of Project Leyden, but I think for the reason the other commenter gave GraalVM isn't exactly it.

Still closer though, and GraalVM native images are pretty cool.

https://mail.openjdk.java.net/pipermail/discuss/2020-April/0...

pjmlp · on May 4, 2022

Only when one is forced to deal with Docker or k8s ecosystem stuff written in Go.

Or as a type safe C for userspace.