Comment by wahern

1 day ago

I find it easier to understand in terms of the Unix syscall API. `2>&1` literally translates as `dup2(1, 2)`, and indeed that's exactly how it works. In the classic unix shells that's all that happens; in more modern shells there may be some additional internal bookkeeping to remember state. Understanding it as dup2 means it's easier to understand how successive redirections work, though you also have to know that redirection operators are executed left-to-right, and traditionally each operator was executed immediately as it was parsed, left-to-right. The pipe operator works similarly, though it's a combination of fork and dup'ing, with the command being forked off from the shell as a child before processing the remainder of the line.

Though, understanding it this way makes the direction of the angled bracket a little odd; at least for me it's more natural to understand dup2(2, 1) as 2<1, as in make fd 2 a duplicate of fd 1, but in terms of abstract I/O semantics that would be misleading.

50 comments

wahern

jez 1 day ago

Another fun consequence of this is that you can initialize otherwise-unset file descriptors this way:

    $ cat foo.sh
    #!/usr/bin/env bash

    >&1 echo "will print on stdout"
    >&2 echo "will print on stderr"
    >&3 echo "will print on fd 3"

    $ ./foo.sh 3>&1 1>/dev/null 2>/dev/null
    will print on fd 3

It's a trick you can use if you've got a super chatty script or set of scripts, you want to silence or slurp up all of their output, but you still want to allow some mechanism for printing directly to the terminal.

The danger is that if you don't open it before running the script, you'll get an error:

    $ ./foo.sh
    will print on stdout
    will print on stderr
    ./foo.sh: line 5: 3: Bad file descriptor

hielke 9 hours ago
With exec you can open file descriptors of your current process.
if [[ ! -e /proc/$$/fd/3 ]]; then # check if fd 3 already open and if not open, open it to /dev/null exec 3>/dev/null fi >&3 echo "will print on fd 3"
This will fix the error you are describing while keeping the functionality intact.
Now with that exec trick the fun only gets started. Because you can redirect to subshells and subshells inherit their redirection of the parent:
set -x # when debugging, print all commands ran prefixed with CMD: PID=$$ BASH_XTRACEFD=7 LOG_FILE=/some/place/to/your/log/or/just/stdout exec 3> >(gawk '!/^RUN \+ echo/{ print strftime("[%Y-%m-%d %H:%M:%S] <PID:'$PID'> "), $0; fflush() }' >> $LOG_FILE) exec > >(sed -u 's/^/INFO: /' >&3) exec 2> >(sed -u 's/^/ERROR: /' >&3) exec 7> >(sed -u 's/^/CMD: /' >&3) exec 8>&1 #normal stdout with >&8 exec 9>&2 #normal stderr with >&9
And now your bash script will have a nice log with stdout and stderr prefixed with INFO and ERROR and has timestamps with the PID.
Now the disclaimer is that you will not have gaurantees that the order of stdout and stderr will be correct unfortunately, even though we run it unbuffered (-u and fflush).
- casey2 4 hours ago
  
  Nice! Not really sure the point since AI can bang out a much more maintainable (and sync'd) wrapper in go in about 0.3 seconds
  (if runners have sh then they might as well have a real compiler scratch > debian > alpine , "don't debug in prod")
account42 14 hours ago

If you just want to print of the terminal even if normal stdout/stderr is disabled you can also use >/dev/tty but obviously that is less flexible.
47282847 1 day ago
Interesting. Is this just literally “fun”, or do you see real world use cases?
- nothrabannosir 21 hours ago
  
  The aws cli has a set of porcelain for s3 access (aws s3) and plumbing commands for lower level access to advanced controls (aws s3api). The plumbing command aws s3api get-object doesn't support stdout natively, so if you need it and want to use it in a pipeline (e.g. pv), you would naively do something like
  $ aws s3api get-object --bucket foo --key bar /dev/stdout | pv ...
  Unfortunately, aws s3api already prints the API response to stdout, and error messages to stderr, so if you do the above you'll clobber your pipeline with noise, and using /dev/stderr has the same effect on error.
  You can, though, do the following:
  $ aws s3api get-object --bucket foo --key bar /dev/fd/3 3>&1 >/dev/null | pv ...
  This will pipe only the object contents to stdout, and the API response to /dev/null.
  
  5 replies →
- jez 1 day ago
  
  I have used this in the past when building shell scripts and Makefiles to orchestrate an existing build system:
  https://github.com/jez/symbol/blob/master/scaffold/symbol#L1...
  The existing build system I did not have control over, and would produce output on stdout/stderr. I wanted my build scripts to be able to only show the output from the build system if building failed (and there might have been multiple build system invocations leading to that failure). I also wanted the second level to be able to log progress messages that were shown to the user immediately on stdout.
  Level 1: create fd=3, capture fd 1/2 (done in one place at the top-level) Level 2: log progress messages to fd=3 so the user knows what's happening Level 3: original build system, will log to fd 1/2, but will be captured
  It was janky and it's not a project I have a need for anymore, but it was technically a real world use case.
- 1718627440 11 hours ago
  
  This is often used by shell scripts to wrap another program, so that those's input and output can be controlled. E.g. Autoconf uses this to invoke the compiler and also to control nested log output.
- figmert 16 hours ago
  
  One of my use-cases previously has been enforcing ultimate or fully trust of a gpg signature.
  tmpfifo="$(mktemp -u -t gpgverifyXXXXXXXXX)" gpg --status-fd 3 --verify checksums.txt.sig checksums.txt 3>$tmpfifo grep -Eq '^\[GNUPG:] TRUST_(ULTIMATE|FULLY)' $tmpfifo
  It was a while ago since I implemented this, but iirc the reason for that was to validate that the key that has signed this is actually trusted, and the signature isn't just cryptographically valid.
  You can also redirect specific file descriptors into other commands:
  gpg --status-fd 3 --verify checksums.txt.sig checksums.txt 3>(grep -Eq '^\[GNUPG:] TRUST_(ULTIMATE|FULLY)')
- jas- 1 day ago
  
  Red hat and other RPM based distributions recommended kickstart scripts use tty3 using a similar method
- post-it 1 day ago
  
  Multiple levels of logging, all of which you want to capture but not all in the same place.
  
  3 replies →

goku12 17 hours ago

This is probably one of the reasons why many find POSIX shell languages to be unpleasant. There are too many syntactical sugars that abstract too much of the underlying mechanisms away, to the level that we don't get it unless someone explains it. Compare this with Lisps, for example. There may be only one branching construct or a looping construct. Yet, they provide more options than regular programming languages using macros. And this fact is not hidden from us. You know that all of them ultimately expand to the limited number of special forms.

The shell syntactical sugars also have some weird gotchas. The &2>&1 question and its answer are a good example of that. You're just trading one complexity (low level knowledge) for another (the long list of syntax rules). Shell languages break the rule of not letting abstractions get in the way of insight and intuitiveness.

I know that people will argue that shell languages are not programming languages, and that terseness is important for the former. And yet, we still have people complaining about it. This is the programmer ego and the sysadmin ego of people clashing with each other. After all, nobody is purely just one of those two.

skywal_l 16 hours ago
There must be a law of system design about this, because this happens all the time. Every abstraction creates a class of users who are powerful but fragile.
People who build a system or at least know how it works internally want to simplify their life by building abstractions.
As people come later to use the system with the embedded abstractions, they only know the abstractions but have no idea of the underlying implementations. Those abstractions used to make perfect sense for those with prior knowledge but can also carry subtle bias which makes their use error prone for non initiated users.
- shevy-java 10 hours ago
  
  > Those abstractions used to make perfect sense for those with prior knowledge but can also carry subtle bias which makes their use error prone for non initiated users.
  I don't think 2>&1 ever made any sense.
  I think shell language is simply awful.
  
  2 replies →
- lukan 11 hours ago
  
  I like abstractions when they hide complexity I don't need to see nor understand to get my job done. But if abstractions misdirect and confuse me, they are not syntactical sugar to me, but rather poison.
  (But I won't claim that I am always able to strike the right balance here)
- taneq 15 hours ago
  
  Seems related to the Law of Leaky Abstractions?
  
  1 reply →
reacweb 17 hours ago
make 2>&1 | tee m.log is in my muscle memory, like adding a & at the end of a command to launch a job, or ctrl+z bg when I forget it, or tar cfz (without the minus so that the order is not important). Without this terseness, people would build myriads of personal alias.
This redirection relies on foundational concepts (file descriptors, stdin 0, stdout 1, stderr 2) that need to be well understood when using unix. IMO, this helps to build insight and intuitiveness. A pipe is not magic, it is just a simple operation on file descriptors. Complexity exists (buffering, zombies), but not there.
- cpach 13 hours ago
  
  Are you sure you understood the comment you replied to?
  I agree that 2>&1 is not complex. But I think I speak for many Bash users when I say that this idiom looks bad, is hard to Google, hard to read and hard to memorize.
  
  4 replies →
miki123211 11 hours ago

Shell is optimized for the minimal number of keystrokes (just like Vim, Amadeus and the Bloomberg Terminal are optimized for the minimum number of keystrokes. Programming languages are primarily optimized for future code readability, with terseness and intuitiveness being second or third (depending on language).
darkwater 13 hours ago
? (defun even(num) (= (mod num 2) 0)) ? (filter '(6 4 3 5 2) #'even)
I'm zero Lisp expert and I don't feel comfortable at all reading this snippet.
- goku12 5 hours ago
  
  This:
  > I'm zero Lisp expert
  and this:
  > I don't feel comfortable at all reading this snippet
  are related. The comfort in reading Lisp comes from how few syntactic/semantic rules there are. There's a standard form and a few special forms. Compare that to C - possibly one of the smallest popular languages around. How many syntactical and semantic rules do you need to know to be a half decent C programmer?
  If you look at the Lisp code, it has just 2 main features - a tree in the form of nested lists and some operations in prefix notation. It needs some getting used to for regular programmers. But it's said that programming newbies learn Lisps faster than regular programming languages, due to the fewer rules they have to remember.

emmelaich 1 day ago

Yep, there's a strong unifying feel between the Unix api, C, the shell, and also say Perl.

Which is lost when using more modern or languages foreign to Unix.

tkcranny 1 day ago

Python too under the hood, a lot of its core is still from how it started as a quick way to do unixy/C things.

momentoftop 10 hours ago

> The pipe operator works similarly, though it's a combination of fork and dup'ing

Any time the shell executes a program it forks, not just for redirections. Redirections will use dup before exec on the child process. Piping will be two forks and obviously the `pipe` syscall, with one process having its stdout dup'd to the input end of the pipe, and the other process having its stdin dup'd to the output end.

Honestly, I find the BASH manual to be excellently written, and it's probably available on your system even without an internet connection. I'd always go there than rely on stack overflow or an LLM.

https://www.gnu.org/software/bash/manual/bash.html#Redirecti...

kccqzy 1 day ago

And just like dup2 allows you to duplicate into a brand new file descriptor, shells also allow you to specify bigger numbers so you aren’t restricted to 1 and 2. This can be useful for things like communication between different parts of the same shell script.

ontouchstart 6 hours ago

I did a google search on “dup2(2, 1)” in a fresh private tab on my iPhone Safari and this thread came up the second, between

https://man7.org/linux/man-pages/man2/dup.2.html

and

https://man.archlinux.org/man/dup2.2.en

A lot of bots are reading this. Amazing.

jolmg 14 hours ago

> Though, understanding it this way makes the direction of the angled bracket a little odd; at least for me it's more natural to understand dup2(2, 1) as 2<1, as in make fd 2 a duplicate of fd 1, but in terms of abstract I/O semantics that would be misleading.

Since they're both just `dup2(1, 2)`, `2>&1` and `2<&1` are the same. However, yes, `2<&1` would be misleading because it looks like you're treating stderr like an input.

ifh-hn 1 day ago

Haha, I'm even more confused now. I have no idea what dup is...

jpollock 1 day ago
There are a couple of ways to figure out.
open a terminal (OSX/Linux) and type:
man dup
open a browser window and search for:
man dup
Both will bring up the man page for the function call.
To get recursive, you can try:
man man unix
(the unix is important, otherwise it gives you manly men)
- Bender 1 day ago
  
  otherwise it gives you manly men
  That's only just after midnight [1][2]
  [1] - https://www.youtube.com/watch?v=XEjLoHdbVeE
  [2] - https://unix.stackexchange.com/questions/405783/why-does-man...
  
  1 reply →
- trashb 16 hours ago
  
  you may also consider gnu info
  info dup

niobe 1 day ago

I find it very intuitive as is

manbash 20 hours ago

Respectfully, what was the purpose of this comment, really?

And I also disagree, your suggestion is not easier. The & operator is quite intuitive as it is, and conveys the intention.

goku12 18 hours ago

Perhaps it is intuitive for you based on how you learned it. But their explanation is more intuitive for anyone dealing with low level stuff like POSIX-style embedded programming, low level unix-y C programming, etc, since it ties into what they already know. There is also a limit to how much you can learn about the underlying system and its unseen potential by learning from the abstractions alone.
> Respectfully, what was the purpose of this comment, really?
Judging by its replies alone, not everyone considers it purposeless. And even though I know enough to use shell redirections correctly, I still found that comment insightful. This is why I still prefer human explanations over AI. It often contains information you didn't think you needed. HN is one of the sources of the gradually dwindling supply of such information. That comment is still on-topic. Please don't discourage such habits.