Comment by juliangmp

3 months ago

I found that a lot of the problems I had been having with mutexes, stem from the fact that traditionally the mutex and the data it protects are separate. Bolting them together, like Rust's Mutex<T> does, solves a lot these problems. It let's you write normal, synchronous code and leave the locking up to the caller, but without making it a nightmare. You can't even access the data without locking the mutex.

This isn't an attack on the (very well written) article though. Just wanted to add my two cents.

56 comments

juliangmp

torginus 3 months ago

Mutexes suffer from a host of problems, and imo are not a very good concurrency primitive - they were designed to turn single-threaded code into multi-threaded. With todays 8+ cores in most systems, usually a single point of contention quickly becomes a problem.

They're liable to deadlocks/livelocks, and sometimes not only with other explicitly Mutex-like things (it might happen some library you use has a lock hidden deep inside).

They're also often backed byOS primitives (with big overheads) with inconsistent behaviors between platforms (spinlocks, waiting etc). We've run into an issue with .NET, that their version of Mutex didn't wake up the blocked thread on Linux as fast as on Windows, meaning we needed about 100x the time to serve a request as the thread was sleeping too long.

There are questions like when to use spinlocks and when to go to wait sleep, which unfortunately the developer has to answer.

Not assigning blame here, just pointing out that threading primitives and behaviors don't translate perfectly between OSes.

Multi-threading is hard, other solutions like queues suffer from issues like backpressure.

That's why I'm skeptical about Rust's fearless concurrency promise - none of these bugs are solved by just figuring out data races - which are a huge issue, but not the only one.

adwn 3 months ago
Your view on mutex performance and overhead is outdated, at least for the major platforms: The Rust standard library mutex only requires 5 bytes, doesn't allocate, and only does a syscall on contention. The mutex implementation in the parking_lot library requires just 1 byte per mutex (and doesn't allocate and only does a syscall on contention). This enables very fine-grained, efficient locking and low contention.
- torginus 3 months ago
  
  These are OS primitives I'm talking about - I haven't checked out the standard library version but the parking_lot version uses a spinlock with thread sleep when the wait times get too high - it has no way of getting notified when the mutex gets unblocked nor does it support priority inversion.
  It seems it's optimized for scenarios with high performance compute heavy code, and short critical sections.
  These assumptions may let it win benchmarks, but don't cover the use cases of all users. To illustrate why this is bad, imagine if you have a Mutex protected resource that becomes available after 10us on average. This locks spins 10 times checking if it has become available )(likely <1us) then yields the thread. The OS (lets assume Linux) wont wake it up the thread until the next scheduler tick, and its under no obligation to do so even then (and has no idea it should). But even best-case, you're left waiting 10ms, which is a typical scheduler tick.
  In contrast OS based solutions are expensive but not that expensive, let's say that add 1us to the wait. Then you would wait 11us for the resource.
  A method call taking 10ms and one taking 15 us is a factor of 60x, which can potentially kill your performance.
  You as the user of the library are implicitly buying into these assumptions which may not hold for your case.
  There's also nothing in Rust that protects you from deadlocks with 100% certainty. You can fuzz them out, and use helpers, but you can do that in any language.
  So you do need to be mindful of how your mutex works, if you want to build a system as good as the one it replaces.
  
  10 replies →
- charleslmunger 3 months ago
  
  Unfortunately the standard library mutex is designed in such a way that condition variables can't use requeue, and so require unnecessary wakeups. I believe parking lot doesn't have this problem.
- ahoka 3 months ago
  
  It's called a futex and supported by both Linux and Windows since ages.
  
  1 reply →
- magicalhippo 3 months ago
  
  How does it avoid cache contention with just a few bytes per mutex? That is, multiple mutex instances sharing a cache line. Say I have a structure with multiple int32 counters protected by their own mutex.
  
  7 replies →
kiitos 3 months ago

this is an overly simplistic and somewhat reductive perspective on a pretty fundamental concept/primitive
> they were designed to turn single-threaded code into multi-threaded
not really
> usually a single point of contention quickly becomes a problem.
not generally, no
> They're liable to deadlocks/livelocks,
deadlocks/livelocks are orthogonal to any specific primitive
> They're also often backed byOS primitives (with big overheads) with inconsistent behaviors between platforms (spinlocks, waiting etc).
the mutex as a primitive is orthogonal to any specific implementation...
etc. etc.

kragen 3 months ago

Traditionally traditionally, monitors were declared together with the data they contained, and the compiler enforced that the data was not accessed outside the monitor. Per Brinch Hansen wrote a rather bitter broadside against Java's concurrency model when it came out.

csb6 3 months ago
Was this the article?
http://brinch-hansen.net/papers/1999b.pdf
- kragen 3 months ago
  
  This is a toned-down, but still scathing, version of what I remember reading.

Nauxuron 3 months ago

> You can't even access the data without locking the mutex.

It's even nicer than that: you can actually access data without locking the mutex, because while you hold a mutable borrow to the mutex, Rust statically guarantees that no one else can acquire locks on the mutex.

https://doc.rust-lang.org/std/sync/struct.Mutex.html#method....

jstimpfle 3 months ago
Given a data item of non-thread safe type (i.e. not Mutex<T> etc), the borrow checker checks that there's only ever one mutable reference to it. This doesn't solve concurrency as it prevents multiple threads from even having the ability to access that data.
Mutex is for where you have that ability, and ensures at runtime that accesses get serialized.
- dwattttt 3 months ago
  
  The maybe unexpected point is that if you know you're the only one who has a reference to a Mutex (i.e. you have a &mut), you don't need to bother lock it; if no one else knows about the Mutex, there's no one else who could lock it. It comes up when you're setting things up and haven't shared the Mutex yet.
  This means no atomic operations or syscalls or what have you.
  
  8 replies →

mgaunard 3 months ago

I find it better to model that as an Actor than a mutex, but I guess it's inherently the same thing, except the actor also allows asynchronous operations.

gpderetta 3 months ago
You can go full circle and also make operations on a mutex asynchronous. Hence the realization that message passing and shared memory are truly dual.
- mgaunard 3 months ago
  
  The very idea of a mutex is that it is synchronous. You wait until you can acquire the mutex.
  If it's asynchronous, it's not a mutex anymore, or it's just used to synchronously setup some other asynchronous mechanism.
  
  10 replies →

indigo945 3 months ago

This doesn't solve the deadlock problem, however.

dist-epoch 3 months ago

Sounds like the Java synchronized class.

masklinn 3 months ago

No. It’s not a property of the type so you can have multiple items under a mutex and you’re not at the mercy of whoever wrote it, it works fine with POD types, it does not force a lock / unlock on each method call (instead the compiler essentially ensures you hold the lock before you can access the data), and the borrow checker is there to ensure you can not leak any sort of sub-states, even though you can call all sorts of helpers which have no requirement to be aware of the locking.
It’s what synchronized classes wish they had been, maybe.
the_gipsy 3 months ago

Not at all. With rust you cannot accidentally leak a reference, and here's the killer: it guarantees these properties at compile time.