netty5

Author	SHA1	Message	Date
Norman Maurer	86dd388637	Revert "Added UDP multicast (with caveats: no ipv6, getInterface, getNetworkI… (#9006 )" This reverts commit `a3e8c86741` as there are some issues that need to be fixed first.	2019-04-12 21:32:22 +02:00
Norman Maurer	fcfa9eb9a8	Throw IOException (not ChannelException) if netty_epoll_linuxsocket_setTcpMd5Sig fails (#9039 ) Motivation: At the moment we throw a ChannelException if netty_epoll_linuxsocket_setTcpMd5Sig fails. This is inconsistent with other methods which throw a IOException. Modifications: Throw IOException Result: More correct and consistent exception usage in epoll transport	2019-04-12 15:15:27 +02:00
Norman Maurer	778ff2057e	Add IPv6 multicast test to testsuite (#9037 ) Motivation: We currently only cover ipv4 multicast in the testsuite but we should also have tests for ipv6. Modifications: - Add test for ipv6 - Ensure we only try to run multicast test for ipv4 / ipv6 if the loopback interface supports it. Result: Better test coverage	2019-04-12 12:29:08 +02:00
Norman Maurer	45b0daf9e6	netty_epoll_linuxsocket_setTcpMd5Sig should throw ChannelException when not able to init sockaddr (#9029 ) Motivation: When netty_epoll_linuxsocket_setTcpMd5Sig fails to init the sockaddr we should throw an exception and not silently return. Modifications: Throw exception if init of sockaddr fails. Result: Correctly report back error to user.	2019-04-11 18:50:16 +02:00
Steve Buzzard	a3e8c86741	Added UDP multicast (with caveats: no ipv6, getInterface, getNetworkI… (#9006 ) …nterface, block or loopback-mode-disabled operations). Motivation: Provide epoll/native multicast to support high load multicast users (we are using it for a high load telecomm app at my day job). Modification: Added support for (ipv4 only) source specific and any source multicast for epoll transport. Some caveats (beyond no ipv6 support initially - there’s a bit of work to add in join and leave group specifically around SSM, as ipv6 uses different data structures for this): no support for disabling loop back mode, retrieval of interface and block operation, all of which tend to be less frequently used. Result: Provides epoll transport multicast for IPv4 for common use cases. Understand if you’d prefer to hold off until ipv6 is included but not sure when I’ll be able to get to that.	2019-04-08 20:13:39 +02:00
Vladimir Kostyukov	0a0da67f43	Introduce SingleThreadEventLoop.registeredChannels (#8428 ) Motivation: Systems depending on Netty may benefit (telemetry, alternative even loop scheduling algorithms) from knowing the number of channels assigned to each EventLoop. Modification: Expose the number of channels registered in the EventLoop via SingleThreadEventLoop.registeredChannels. Result: Fixes #8276.	2019-03-28 11:33:12 +00:00
Norman Maurer	8206604003	Upgrade to new netty-build and com.puppycrawl.tools 8.18 (#8980 ) Motivation: com.puppycrawl.tools checkstyle < 8.18 was reported to contain a possible security flaw. We should upgrade. Modifications: - Upgrade netty-build and checkstyle. - Fix checkstyle errors Result: Fixes https://github.com/netty/netty/issues/8968.	2019-03-26 14:21:34 +01:00
Lunfu Zhong	e7b3195570	Support ALLOW_HALF_CLOSURE channel option on Unix domain socket. (#8932 ) Motivation: Since DomainSocketChannel is a DuplexChannel, which be able to shutdown input or output individually on demands, but ALLOW_HALF_CLOSURE channel option has not been supported yet. I thought this could be a missing feature of Unix domain socket, so here the PR for it. Modifications: 1. Added allHalfClosure property both in EpollDomainSocketChannelConfig and KQueueDomainSocketChannelConfig, 2. Enabled isAllowHalfClosure method of native channel to support domain channel config, 3. Created EpollDomainSocketShutdownOutputByPeerTest and KQueueDomainSocketShutdownOutputByPeerTest to verify the change. Result: ALLOW_HALF_CLOSURE channel option can be set with DomainSocketChannel, and no more warning of Unknown channel option 'ALLOW_HALF_CLOSURE'.	2019-03-19 11:24:07 +01:00
Norman Maurer	81e43d5088	DefaultFileRegion.transferTo with invalid count may cause busy-spin (#8885 ) Motivation: `DefaultFileRegion.transferTo` will return 0 all the time when we request more data then the actual file size. This may result in a busy spin while processing the fileregion during writes. Modifications: - If we wrote 0 bytes check if the underlying file size is smaller then the requested count and if so throw an IOException - Add DefaultFileRegionTest - Add a test to the testsuite Result: Fixes https://github.com/netty/netty/issues/8868.	2019-02-26 11:08:09 +01:00
Norman Maurer	fa6a8cb09c	Support using an Executor to offload blocking / long-running tasks wh… (#8847 ) Motivation: The SSLEngine does provide a way to signal to the caller that it may need to execute a blocking / long-running task which then can be offloaded to an Executor to ensure the I/O thread is not blocked. Currently how we handle this in SslHandler is not really optimal as while we offload to the Executor we still block the I/O Thread. Modifications: - Correctly support offloading the task to the Executor while suspending processing of SSL in the I/O Thread - Add new methods to SslContext to specify the Executor when creating a SslHandler - Remove @deprecated annotations from SslHandler constructor that takes an Executor - Adjust tests to also run with the Executor to ensure all works as expected. Result: Be able to offload long running tasks to an Executor when using SslHandler. Partly fixes https://github.com/netty/netty/issues/7862 and https://github.com/netty/netty/issues/7020.	2019-02-11 09:47:44 +01:00
Norman Maurer	c6a90d90a6	Add more tests to KQueue and Epoll testsuites. (#8851 ) Motivation: We missed to extend a few tests from the testsuite and so also run these with our native KQueue and Epoll transport. Modifications: Extend tests and so run these for our native transports as well. Result: More tests.	2019-02-08 20:08:34 +01:00
Norman Maurer	7f61055cbd	Reduce direct memory overhead per EpollEventLoop when using EpollDatagramChannel (#8825 ) Motivation: When using a linux distribution that supports sendmmsg(...) we allocated enough direct memory per EpollEventLoop to be able to write IOV_MAX number of iovecs per message that can be written per sendmmsg. The number of messages that can be written per sendmmsg(...) call is limited by UIO_MAX_IOV. In practice this resulted in an allocation of 16MB direct memory per EpollEventLoop instance that stayed allocated until the EpollEventLoop was shutdown which happens as part of the shutdown of the enclosing EpollEVentLoopGroup. This resulted in quite some heavy direct memory usage in practice even when in practice we have very slim changes to ever need all of the memory. Modification: Adjust NativeDatagramPacketArray to share one IovArray instance across all NativeDatagramPacket instances it holds. This limits the max number of iovecs we can write across all messages to IOV_MAX per sendmmsg(...) call. This in practice will still be enough to allow us to write multiple messages with one syscall while keep the memory overhead to a minimum. Result: Smaller direct memory footprint per EpollEventLoop when using EpollDatagramChannel on distributions that support sendmmsg(...). Fixes https://github.com/netty/netty/issues/8814	2019-02-02 07:10:02 +01:00
Norman Maurer	7bba4f49cf	Reduce GC produced by native DatagramChannel implementations when in connected mode. (#8806 ) Motivation: In the native code EpollDatagramChannel / KQueueDatagramChannel creates a DatagramSocketAddress object for each received UDP datagram even when in connected mode as it uses the recvfrom(...) / recvmsg(...) method. Creating these is quite heavy in terms of allocations as internally, char[], String, Inet4Address, InetAddressHolder, InetSocketAddressHolder, InetAddress[], byte[] objects are getting generated when constructing the object. When in connected mode we can just use regular read(...) calls which do not need to allocate all of these. Modifications: - When in connected mode use read(...) and NOT recvfrom(..) / readmsg(...) to reduce allocations when possible. - Adjust tests to ensure read works as expected when in connected mode. Result: Less allocations and GC when using native datagram channels in connected mode. Fixes https://github.com/netty/netty/issues/8770.	2019-02-01 10:29:36 +01:00
田欧	a33200ca38	use checkPositive/checkPositiveOrZero (#8803 ) Motivation: We have a utility method to check for > 0 and >0 arguments. We should use it. Modification: use checkPositive/checkPositiveOrZero instead of if statement. Result: Re-use utility method.	2019-01-31 09:07:14 +01:00
Norman Maurer	cd3254df88	Update to new checkstyle plugin (#8777 ) (#8780 ) Motivation: We need to update to a new checkstyle plugin to allow the usage of lambdas. Modifications: - Update to new plugin version. - Fix checkstyle problems. Result: Be able to use checkstyle plugin which supports new Java syntax.	2019-01-25 11:58:42 +01:00
yulianoifa-mobius	1e4481e551	Allowed IP_FREEBIND option for UDP epoll (#8728 ) Motivation: While using Load Balancers or HA support is needed there are cases when UDP channel need to bind to IP Address which is not available on network interfaces locally. Modification: Modified EpollDatagramChannelConfig to allow IP_FREEBIND option Result: Fixes ##8727.	2019-01-21 07:42:05 +01:00
Dmitriy Dumanskiy	165912365a	Clenaup: simplify EpollEventLoop.closeAll() (#8719 ) Motivation: Avoid unnecessary iteration and `ArrayList` allocation. Modification: ``` for (AbstractEpollChannel channel: channels.values()) { array.add(channel); } ``` replaced with `array.addAll(channels.values())` and ``` Collection<AbstractEpollChannel> array = new ArrayList<AbstractEpollChannel>(channels.size()); array.addAll(channels.values()) ``` replaced with: `AbstractEpollChannel[] localChannels = channels.values().toArray(new AbstractEpollChannel[0]);` Result: Simpler code in `EpollEventLoop.closeAll();`	2019-01-16 11:00:25 +01:00
Norman Maurer	d4b1202e62	Add testcase for epollWait(...) with negative timerfd values. (#8447 ) Motivation: https://github.com/netty/netty/issues/8444 reports that there is some issue with negative values passed to timerfd_settime. This test verifies that everything is working as expected. Modifications: Add testcase. Result: Test to verify expected behaviour.	2018-10-30 19:38:02 +01:00
Johno Crawford	5b3b8db07f	epoll_wait produces an EINVAL error since 4.1.30 (#8350 ) Motivation: epoll_wait should work in 4.1.30 like it did in 4.1.29. Modifications: Revert Integer.MAX_VALUE back to MAX_SCHEDULED_TIMERFD_NS (999,999,999). Add unit test. Result: epoll_wait will no longer throw EINVAL.	2018-10-12 05:02:41 +02:00
Matteo Merli	3a96e7373b	Added option to do busy-wait on epoll (#8267 ) Motivation: Add an option (through a SelectStrategy return code) to have the Netty event loop thread to do busy-wait on the epoll. The reason for this change is to avoid the context switch cost that comes when the event loop thread is blocked on the epoll_wait() call. On average, the context switch has a penalty of ~13usec. This benefits both: The latency when reading from a socket Scheduling tasks to be executed on the event loop thread. The tradeoff, when enabling this feature, is that the event loop thread will be using 100% cpu, even when inactive. Modification: Added SelectStrategy option to return BUSY_WAIT Epoll loop will do a epoll_wait() with no timeout Use pause instruction to hint to processor that we're in a busy loop Result: When enabled, minimizes impact of context switch in the critical path	2018-09-28 22:52:00 +02:00
Roger	6138541033	Avoid repeating the same field and hiding it (#8335 ) Motivation The EpollChannelConfig (same for KQueues) and its subclasses repeatetly declare their own channel field which leads to a 3x repetition for each config instance. Given the fields are protected or package-private it's exposing the code code to "field hiding" bugs. Modifications Use the the existing protected channel field from the DefaultChannelConfig class and simply cast it when needed. Result Fixes #8331	2018-09-28 17:37:14 +02:00
Carl Mastrangelo	1dff107de1	Don't re-arm timerfd each epoll_wait (#7816 ) Motivation: The Epoll transport checks to see if there are any scheduled tasks before entering epoll_wait, and resets the timerfd just before. This causes an extra syscall to timerfd_settime before doing any actual work. When scheduled tasks aren't added frequently, or tasks are added with later deadlines, this is unnecessary. Modification: Check the deadline of the peeked task in EpollEventLoop, rather than the delay. If it hasn't changed since last time, don't re-arm the timer Result: About 2us faster on gRPC RTT 50pct latency benchmarks. Before (2 runs for 5 minutes, 1 minute of warmup): ``` 50.0%ile Latency (in nanos): 64267 90.0%ile Latency (in nanos): 72851 95.0%ile Latency (in nanos): 78903 99.0%ile Latency (in nanos): 92327 99.9%ile Latency (in nanos): 119691 100.0%ile Latency (in nanos): 13347327 QPS: 14933 50.0%ile Latency (in nanos): 63907 90.0%ile Latency (in nanos): 73055 95.0%ile Latency (in nanos): 79443 99.0%ile Latency (in nanos): 93739 99.9%ile Latency (in nanos): 123583 100.0%ile Latency (in nanos): 14028287 QPS: 14936 ``` After: ``` 50.0%ile Latency (in nanos): 62123 90.0%ile Latency (in nanos): 70795 95.0%ile Latency (in nanos): 76895 99.0%ile Latency (in nanos): 90887 99.9%ile Latency (in nanos): 117819 100.0%ile Latency (in nanos): 14126591 QPS: 15387 50.0%ile Latency (in nanos): 61021 90.0%ile Latency (in nanos): 70311 95.0%ile Latency (in nanos): 76687 99.0%ile Latency (in nanos): 90887 99.9%ile Latency (in nanos): 119527 100.0%ile Latency (in nanos): 6351615 QPS: 15571 ```	2018-09-11 13:38:38 +02:00
Matteo Merli	2a1596a4e9	Allow to configure socket option SO_BUSY_POLL (#8268 ) Motivation: When using Epoll based transport, allow applications to configure SO_BUSY_POLL socket option: SO_BUSY_POLL (since Linux 3.11) Sets the approximate time in microseconds to busy poll on a blocking receive when there is no data. Increasing this value requires CAP_NET_ADMIN. The default for this option is con‐ trolled by the /proc/sys/net/core/busy_read file. The value in the /proc/sys/net/core/busy_poll file determines how long select(2) and poll(2) will busy poll when they oper‐ ate on sockets with SO_BUSY_POLL set and no events to report are found. In both cases, busy polling will only be done when the socket last received data from a network device that supports this option. While busy polling may improve latency of some applications, care must be taken when using it since this will increase both CPU utilization and power usage. Modification: Added SO_BUSY_POLL socket option Result: Able to configure SO_BUSY_POLL from Netty	2018-09-07 20:50:51 +02:00
Norman Maurer	b73f785631	We should call the UnLoad methods when we detect an error during calling OnLoad (#8237 ) Motivation: We should ensure we call UnLoad when we detect an error during calling OnLoad and previous OnLoad calls were succesfull. Modifications: Correctly call UnLoad when needed. Result: More correct code and no leaks when an error happens during loading the native lib.	2018-08-30 06:56:42 +02:00
Norman Maurer	54f565ac67	Allow to use native transports when sun.misc.Unsafe is not present on… (#8231 ) * Allow to use native transports when sun.misc.Unsafe is not present on the system Motivation: We should be able to use the native transports (epoll / kqueue) even when sun.misc.Unsafe is not present on the system. This is especially important as Java11 will be released soon and does not allow access to it by default. Modifications: - Correctly disable usage of sun.misc.Unsafe when -PnoUnsafe is used while running the build - Correctly increment metric when UnpooledDirectByteBuf is allocated. This was uncovered once -PnoUnsafe usage was fixed. - Implement fallbacks in all our native transport code for when sun.misc.Unsafe is not present. Result: Fixes https://github.com/netty/netty/issues/8229.	2018-08-29 19:36:33 +02:00
Norman Maurer	ea4c315b45	Ensure multiple shaded version of the same netty artifact can be loaded as long as the shaded prefix is different (#8207 ) Motivation: We should support to load multiple shaded versions of the same netty artifact as netty is often used in multiple dependencies. This is related to https://github.com/netty/netty/issues/7272. Modifications: - Use -fvisibility=hidden when compiling and use JNIEXPORT for things we really want to have exported - Ensure fields are declared as static so these are not exported - Adjust testsuite-shading to use install_name_tool on MacOS to change the id of the lib. Otherwise the wrong may be used. Result: Be able to use multiple shaded versions of the same netty artifact.	2018-08-21 07:53:45 +02:00
Ziyan Mo	785473788f	(Nio\|Epoll)EventLoop.pendingTasks does not need to dispatch to the EventLoop (#8197 ) Motivation: EventLoop.pendingTasks should be (reasonably) cheap to invoke so it can be used within observability. Modifications: Remove code that dispatch access to the internal taskqueue to the EventLoop when invoked as this is not needed anymore with the current MPSC queues we are using. See https://github.com/netty/netty/issues/8196#issuecomment-413653286. Result: Fixes https://github.com/netty/netty/issues/8196	2018-08-18 07:28:31 +02:00
Nick Hill	630c82717d	Lazy initialize NativeDatagramPacketArray and IovArray in EpollEventLoop (#8160 ) Motivation: Avoid unnecessary native memory allocation if UDP / TCP isn't being used. Modifications: Create the reused NativeDatagramPacketArray and IovArray upon first use instead of EpollEventLoop construction. Also correct related comment in NativeDatagramPacketArray. Result: Reduced native memory use when using epoll in many cases	2018-07-29 18:22:27 +08:00
Norman Maurer	05e5ab1ecb	Store NativeDatagramPacketArray directly in the EpollEventLoop Motivation: We can store the NativeDatagramPacketArray directly in the EpollEventLoop. This removes the need of using FastThreadLocal. Modifications: - Store NativeDatagramPacketArray directly in the EpollEventLoop (just as we do with IovArray as well). Result: Less FastThreadLocal usage and more consistent code.	2018-06-27 20:52:28 +02:00
Scott Mitchell	12f6500a4f	Epoll and Kqueue shouldn't read by default (#8024 ) Motivation: Epoll and Kqueue channels have internal state which forces a single read operation after channel construction. This violates the Channel#read() interface which indicates that data shouldn't be delivered until this method is called. The behavior is also inconsistent with the NIO transport. Modifications: - Epoll and Kqueue shouldn't unconditionally read upon initialization, and instead should rely upon Channel#read() or auto_read. Result: Epoll and Kqueue are more consistent with NIO.	2018-06-15 10:28:50 +02:00
Norman Maurer	d133bf06a4	Allow to schedule tasks up to Long.MAX_VALUE (#7972 ) Motivation: We should allow to schedule tasks with a delay up to Long.MAX_VALUE as we did pre 4.1.25.Final. Modifications: Just ensure we not overflow and put the correct max limits in place when schedule a timer. At worse we will get a wakeup to early and then schedule a new timeout. Result: Fixes https://github.com/netty/netty/issues/7970.	2018-05-30 11:11:42 +02:00
Norman Maurer	030318e53c	Read until all data is consumed when EOF is detected even if readPend… (#7961 ) * Read until all data is consumed when EOF is detected even if readPending is false and auto-read is disabled. Motivation: We should better always notify the user of EOF even if the user did not request any data as otherwise we may never be notified when the remote peer closes the connection. This should be ok as the amount of extra data we may read and so fire through the pipeline is limited by SO_RECVBUF. Modifications: - Always drain the socket when EOF is detected. - Add testcase Result: No risk for the user to be not notified of EOF.	2018-05-24 20:29:29 +02:00
Norman Maurer	358249e5c9	Allow to disable native transport and native ssl support via system property. (#7903 ) Motivation: Sometimes it's useful to disable native transports / native ssl to debug a problem. We should allow to do so with a system property so people not need to adjust code for this. Modifications: Add system properties which allow to disable native transport and native ssl. Result: Easier to disable native code usage without code changes.	2018-05-04 14:44:44 +02:00
Devrim Şahin	b818852cdb	Update DatagramPacket.recipient() to return the actual destination IP (#7879 ) Motivation: DatagramPacket.recipient() doesn't return the actual destination IP, but the IP the app is bound to. Modification: - IP_RECVORIGDSTADDR option is enabled for UDP sockets, which allows retrieval of ancillary information containing the original recipient. - _recvFrom(...) function from transport-native-unix-common/src/main/c/netty_unix_socket.c is modified such that if IP_RECVORIGDSTADDR is set, recvmsg is used instead of recvfrom; enabling the retrieval of the original recipient. - DatagramSocketAddress also contains a 'local' address, representing the recipient. - EpollDatagramChannel is updated to return the retrieved recipient address instead of the address the channel is bound to. Result: Fixes #4950.	2018-04-26 08:00:36 +02:00
Norman Maurer	b47fb81799	EventLoop.schedule with big delay fails (#7402 ) Motivation: Using a very huge delay when calling schedule(...) may cause an Selector error when calling select(...) later on. We should gaurd against such a big value. Modifications: - Add guard against a very huge value. - Added tests. Result: Fixes [#7365]	2018-04-24 11:15:20 +02:00
Devrim Şahin	0b690a991f	Added IP_TRANSPARENT option for UDP (#7872 ) Motivation: This allows netty to operate in 'transparent proxy' mode for UDP, intercepting connections to other addresses by means of Linux firewalling rules, as per https://www.kernel.org/doc/Documentation/networking/tproxy.txt Modification: Add IP_TRANSPARENT option. Result: Allows setting and getting of the IP_TRANSPARENT option, which allows retrieval of the ultimate socket address originally requested.	2018-04-17 09:07:02 +02:00
Scott Mitchell	ed0668384b	NIO read spin event loop spin when half closed (#7801 ) Motivation: AbstractNioByteChannel will detect that the remote end of the socket has been closed and propagate a user event through the pipeline. However if the user has auto read on, or calls read again, we may propagate the same user events again. If the underlying transport continuously notifies us that there is read activity this will happen in a spin loop which consumes unnecessary CPU. Modifications: - AbstractNioByteChannel's unsafe read() should check if the input side of the socket has been shutdown before processing the event. This is consistent with EPOLL and KQUEUE transports. - add unit test with @normanmaurer's help, and make transports consistent with respect to user events Result: No more read spin loop in NIO when the channel is half closed.	2018-03-28 20:02:57 +02:00
Norman Maurer	0a8e1aaf19	Flush task should not flush messages that were written since last flush attempt. Motivation: The flush task is currently using flush() which will have the affect of have the flush traverse the whole ChannelPipeline and also flush messages that were written since we gave up flushing. This is not really correct as we should only continue to flush messages that were flushed at the point in time when the flush task was submitted for execution if the user not explicit call flush() by him/herself. Modification: Call *Unsafe.flush0() via the flush task which will only continue flushing messages that were marked as flushed before. Result: More correct behaviour when the flush task is used.	2018-03-02 10:09:40 +09:00
Scott Mitchell	ce241bd11e	Epoll flush/writabilityChange deadlock Motivation: `b215794de3` recently introduced a change in behavior where writeSpinCount provided a limit for how many write operations were attempted per flush operation. However when the write quantum was meet the selector write flag was not cleared, and the channel unsafe flush0 method has an optimization which prematurely exits if the write flag is set. This may lead to no write progress being made under the following scenario: - flush is called, but the socket can't accept all data, we set the write flag - the selector wakes us up because the socket is writable, we write data and use the writeSpinCount quantum - we then schedule a flush() on the EventLoop to execute later, however it the flush0 optimization prematurely exits because the write flag is still set In this scenario the socket is still writable so the EventLoop may never notify us that the socket is writable, and therefore we may never attempt to flush data to the OS. Modifications: - When the writeSpinCount quantum is exceeded we should clear the selector write flag Result: Fixes https://github.com/netty/netty/issues/7729	2018-02-20 11:40:58 +01:00
Scott Mitchell	33ddb83dc1	IovArray#add return value resulted in more ByteBufs being added during iteration Motivation: IovArray implements MessageProcessor, and the processMessage method will continue to be called during iteration until it returns true. A recent commit `b215794de3` changed the return value to only return true if any component of a CompositeByteBuf was added as a result of the method call. However this results in the iteration continuing, and potentially subsequent smaller buffers maybe added, which will result in out of order writes and generally corrupts data. Modifications: - IovArray#add should return false so that the MessageProcessor#processMessage will stop iterating. Result: Native transports which use IovArray will not corrupt data during gathering writes of CompositeByteBuf objects.	2018-01-04 08:04:32 -08:00
Scott Mitchell	af2f343648	FileDescriptor writev core dump Motivation: FileDescriptor#writev calls JNI code, and that JNI code dereferences a NULL pointer which crashes the application. This occurs when writing a single CompositeByteBuf object with more than one component. Modifications: - Initialize the iovec iterator properly to avoid the core dump - Fix the array length calculation if we aren't able to fit all the ByteBuffer objects in the iovec array Result: No more core dump.	2017-12-14 16:47:31 -08:00
Scott Mitchell	b215794de3	Enforce writeSpinCount to limit resource consumption per socket (#7478 ) Motivation: The writeSpinCount currently loops over the same buffer, gathering write, file write, or other write operation multiple times but will continue writing until there is nothing left or the OS doesn't accept any data for that specific write. However if the OS keeps accepting writes there is no way to limit how much time we spend on a specific socket. This can lead to unfair consumption of resources dedicated to a single socket. We currently don't limit the amount of bytes we attempt to write per gathering write. If there are many more bytes pending relative to the SO_SNDBUF size we will end up building iov arrays with more elements than can be written, which results in extra iteration, conditionals, and book keeping. Modifications: - writeSpinCount should limit the number of system calls we make to write data, instead of applying to individual write operations - IovArray should support a maximum number of bytes - IovArray should support composite buffers of greater than size 1024 - We should auto-scale the amount of data that we attempt to write per gathering write operation relative to SO_SNDBUF and how much data is successfully written - The non-unsafe path should also support a maximum number of bytes, and respect the IOV_MAX limit Result: Write resource consumption can be bounded and gathering writes have a limit relative to the amount of data which can actually be accepted by the socket.	2017-12-07 16:00:52 -08:00
Norman Maurer	3f101caa4c	Not call java methods from within JNI init code to prevent class loading deadlocks. Motivation: We used NetUtil.isIpV4StackPreferred() when loading JNI code which tries to load NetworkInterface in its static initializer. Unfortunally a lock on the NetworkInterface class init may be already hold somewhere else which may cause a loader deadlock. Modifications: Add a new Socket.initialize() method that will be called when init the library and pass everything needed to the JNI level so we not need to call back to java. Result: Fixes [#7458].	2017-12-06 14:34:15 +01:00
Norman Maurer	251bb1a739	Not use safeRelease(...) but release(...) to release non-readable holders to ensure we not mask errors. Motivation: AbstractChannel attempts to "filter" messages which are written [1]. A goal of this process is to copy from heap to direct if necessary. However implementations of this method [2][3] may translate a buffer with 0 readable bytes to EMPTY_BUFFER. This may mask a user error where an empty buffer is written but already released. Modifications: Replace safeRelease(...) with release(...) to ensure we propagate reference count issues. Result: Fixes [#7383]	2017-12-04 20:38:35 +01:00
Norman Maurer	b1cc5835ac	Not directly call getsockopt but use exported helper function Motivation: To better isolate OS system calls we should not call getsockopt directly but use our netty_unix_socket_getOption0 function. See is a followup of `f115bf5`. Modifications: Export netty_unix_socket_getOption0 by declaring it in the header file and use it Result: Better isolation of system calls.	2017-11-22 08:40:44 +01:00
Norman Maurer	f115bf50cb	EpollSocketChannelConfig.getOptions() must not throw if TCP_FASTOPEN_CONNECT is not supported Motivation: If a user calls EpollSocketChannelConfig.getOptions() and TCP_FASTOPEN_CONNECT is not supported we throw an exception. Modifications: - Just return 0 if ENOPROTOOPT is set. - Add testcase Result: getOptions() works as epxected.	2017-11-16 14:07:54 +01:00
Norman Maurer	e7f02b1dc0	Set readPending to false when EOF is detected while issue an read Motivation: We need to set readPending to false when we detect a EOF while issue a read as otherwise we may not unregister from the Selector / Epoll / KQueue and so keep on receving wakeups. The important bit is that we may even get a wakeup for a read event but will still will only be able to read 0 bytes from the socket, so we need to be very careful when we clear the readPending. This can happen because we generally using edge-triggered mode for our native transports and because of the nature of edge-triggered we may schedule an read event just to find out there is nothing left to read atm (because we completely drained the socket on the previous read). Modifications: Set readPending to false when EOF is detected. Result: Fixes [#7255].	2017-11-06 15:44:36 -08:00
Norman Maurer	bcad9dbf97	Revert "Set readPending to false when ever a read is done" This reverts commit `413c7c2cd8` as it introduced an regression when edge-triggered mode is used which is true for our native transports by default. With `413c7c2cd8` included it was possible that we set readPending to false by mistake even if we would be interested in read more.	2017-11-06 09:21:42 -08:00
Janusz Dziemidowicz	cdb2a27857	Add TCP_FASTOPEN_CONNECT epoll option Motivation: Linux kernel 4.11 introduced a new socket option, TCP_FASTOPEN_CONNECT, that greatly simplifies making TCP Fast Open connections on client side. Usually simply setting the flag before connect() call is enough, no more changes are required. Details can be found in kernel commit: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=19f6d3f3 Modifications: TCP_FASTOPEN_CONNECT socket option was added to EpollChannelOption class. Result: Netty clients can easily make TCP Fast Open connections. Simply calling option(EpollChannelOption.TCP_FASTOPEN_CONNECT, true) in client bootstrap is enough (given recent enough kernel).	2017-10-29 13:42:15 +01:00
Scott Mitchell	413c7c2cd8	Set readPending to false when ever a read is done Motivation: readPending is currently only set to false if data is delivered to the application, however this may result in duplicate events being received from the selector in the event that the socket was closed. Modifications: - We should set readPending to false before each read attempt for all transports besides NIO. - Based upon the Javadocs it is possible that NIO may have spurious wakeups [1]. In this case we should be more cautious and only set readPending to false if data was actually read. [1] https://docs.oracle.com/javase/7/docs/api/java/nio/channels/SelectionKey.html That a selection key's ready set indicates that its channel is ready for some operation category is a hint, but not a guarantee, that an operation in such a category may be performed by a thread without causing the thread to block. Result: Notification from the selector (or simulated events from kqueue/epoll ET) in the event of socket closure. Fixes https://github.com/netty/netty/issues/7255	2017-10-25 08:25:54 -07:00

1 2 3 4 5 ...

315 Commits