I suppose the issue could be alleviated by having the tunnel know it's payload i...

lxgr · on Nov 13, 2020

Interesting idea, although I'd argue that if it should ACK like UDP, retransmit like UDP and control flow and congestion like UDP... You should use UDP ;)

However, firewalls are usually much more permissive to TCP than to UDP. I wonder if there is any project that encapsulates UDP-semantic datagrams into TCP-looking segments?

bonzini · on Nov 13, 2020

UDP is treated fairly well by firewalls... at least compared to SCTP for example. QUIC/HTTP3 are UDP-based and even though there's usually a TCP/HTTP2 fallback they fare reasonably well.

iso1631 · on Nov 13, 2020

I have various boxes which we send out to venues, on the whole outgoing connections are fine, but sometimes you get some really restrictive policies. I've had stuff that MITMs TCP/443, completely blocks UDP, etc.

My devices tend to try to connect back via

* UDP port 443, sometimes works

* an sstp vpn

* SSH to tcp/$highnumber, sometimes they blck/MITM port 80, 443, but leave the standard

* DNS

I can't think of a time that one of them didn't through.

lxgr · on Nov 16, 2020

It does make it through many firewalls these days, yes.

But all implementations I know use a much shorter timeout/keepalive period for UDP than they use for TCP because of firewalls/NATs. (I think the RFCs even recommend something like 300 seconds for TCP, but only 30 for UDP as a default?)

This has pretty significant implications on power consumption for mobile devices.

pklausler · on Nov 13, 2020

You meant it should not ack (like UDP doesn't), not retransmit (like UDP doesn't), and perform no flow/congestion control (like UDP doesn't), yes?

amaccuish · on Nov 13, 2020

your phrasing and their phrasing mean the same thing.

pklausler · on Nov 13, 2020

Thank you for confirming that; it wasn't clear to me.

xaedes · on Nov 13, 2020

That is very interesting!

What exactly do you mean with "completely stalled connections"?

Do you mean, that the sending side is queueing up to-be-send messages and can't clear the queue because it is working on correcting packet losses all the time, so the queue will just grow and never shrink?

Do you recall at which % of packet loss this behaviour started?

jusssi · on Nov 13, 2020

Stalled as in, sender has data to send, receiver is ready to receive, but the transfer makes no progress or proceeds very slowly.

I can't quote a number for the drop percentage, honestly I've forgot. Discovering the limit was a side effect, we were simply looking to test how a piece of software would behave over a bad connection. I just remember being surprised that everything just stopped when I put in packet loss that wasn't anywhere near 100%.

My since-then-adjusted expectations would put any double digit percentage (yes, starting from 10%) of random packet loss as unusable conditions for TCP.

skyde · on Nov 13, 2020

I understand that retransmit algorithm in the top level tcp would be useless because the bottom layer is expected to be reliable.

A better option would be for the tunnel to generate fake ACK to avoid retransmission happening?

sly010 · on Nov 13, 2020

Right, the issue is not the tunnel, it's the essentially infinite buffer. Any large buffer will have a negative effect.

ejanus · on Nov 13, 2020

Thanks for the link. How could one monitor the simulation? Any guide?

eru · on Nov 13, 2020

About your experiment: TCP BBR should behave better, I suspect.

zlynx · on Nov 13, 2020

In my personal experience, BBR with SACK works pretty well even on fairly bad connections. It still slows way down but not as badly as others like Cubic.

That was with BitTorrent uploads of Linux ISOs to Taiwan. (Why do they download so many copies of Ubuntu 14.04 LTS?)

But since I didn't do controlled tests of multiple congestion types I could just be seeing things.