netty5

Author	SHA1	Message	Date
Scott Mitchell	9572868e57	Fix Native EPOLL Build Failure Motiviation: If sendmmsg is already defined then the native epoll module failed to build because of conflicting definitions. The mmsghdr type was also redefined on systems that already supported this structure. Modifications: Provide a way so that systems which already define sendmmsg and mmsghdr can build Provide a way so that systems which don't define sendmmsg and mmsghdr can build Result: The native EPOLL module can build in more environments	2014-09-17 20:59:21 +02:00
Norman Maurer	3cf65e3987	Directly write CompositeByteBuf if possible without memory copy. Related to [#2719 ] Motivation: In linux it is possible to write more then one buffer withone syscall when sending datagram messages. Modifications: Not copy CompositeByteBuf if it only contains direct buffers. Result: More performance due less overhead for copy.	2014-09-10 14:33:05 +02:00
Norman Maurer	9428d51e2f	Add support for sendmmsg(...) and so allow to write multiple DatagramPackets with one syscall. Related to [#2719 ] Motivation: On linux with glibc >= 2.14 it is possible to send multiple DatagramPackets with one syscall. This can be a huge performance win and so we should support it in our native transport. Modification: - Add support for sendmmsg by reuse IovArray - Factor out ThreadLocal support of IovArray to IovArrayThreadLocal for better separation as we use IovArray also without ThreadLocal in NativeDatagramPacketArray now - Introduce NativeDatagramPacketArray which is used for sendmmsg(...) - Implement sendmmsg(...) via jni - Expand DatagramUnicastTest to test also sendmmsg(...) Result: Netty now automatically use sendmmsg(...) if it is supported and we have more then 1 DatagramPacket in the ChannelOutboundBuffer and flush() is called.	2014-09-09 09:49:53 +02:00
Norman Maurer	646753c2eb	Allow to write CompositeByteBuf directly via EpollDatagramChannel. Related to [#2719 ] Motivation: On linux it is possible to use the sendMsg(...) system call to write multiple buffers with one system call when using datagram/udp. Modifications: - Implement the needed changes and make use of sendMsg(...) if possible for max performance - Add tests that test sending datagram packets with all kind of different ByteBuf implementations. Result: Performance improvement when using CompoisteByteBuf and EpollDatagramChannel.	2014-09-09 09:45:03 +02:00
Norman Maurer	cb1bf1a74e	[#2867 ] Workaround performance issue with IPv4-mapped-on-IPv6 addresses Motivation: InetAddress.getByName(...) uses exceptions for control flow when try to parse IPv4-mapped-on-IPv6 addresses. This is quite expensive. Modifications: Detect IPv4-mapped-on-IPv6 addresses in the JNI level and convert to IPv4 addresses before pass to InetAddress.getByName(...) (via InetSocketAddress constructor). Result: Eliminate performance problem causes by exception creation when parsing IPv4-mapped-on-IPv6 addresses.	2014-09-09 07:23:53 +02:00
Norman Maurer	9b99a611ad	[#2823 ] Writing DefaultFileRegion with EpollSocketChannel may cause hang Motivation: In EpollSocketchannel.doWriteFileRegion(...) we need to make sure we write until sendFile(...) returns either 0 or all is written. Otherwise we may not get notified once the Channel is writable again. This is the case as we use EPOLL_ET. Modifications: Always write until either sendFile returns 0 or all is written. Result: No more hangs when writing DefaultFileRegion can happen.	2014-08-26 15:09:09 +02:00
Norman Maurer	1504d2044d	Allow efficient writing of CompositeByteBuf when using native epoll transport. Motivation: There were no way to efficient write a CompositeByteBuf as we always did a memory copy to a direct buffer in this case. This is not needed as we can just write a CompositeByteBuf as long as all the components are buffers with a memory address. Modifications: - Write CompositeByteBuf which contains only direct buffers without memory copy - Also handle CompositeByteBuf that have more components then 1024. Result: More efficient writing of CompositeByteBuf.	2014-08-21 10:57:45 +02:00
Trustin Lee	b5f61d0de5	[maven-release-plugin] prepare for next development iteration	2014-08-16 03:27:42 +09:00
Trustin Lee	76ac3b21a5	[maven-release-plugin] prepare release netty-4.1.0.Beta3	2014-08-16 03:27:37 +09:00
Trustin Lee	b3c1904cc9	[maven-release-plugin] prepare for next development iteration	2014-08-15 09:31:03 +09:00
Trustin Lee	e013b2400f	[maven-release-plugin] prepare release netty-4.1.0.Beta2	2014-08-15 09:30:59 +09:00
Trustin Lee	061d5bc261	Fix data corruption in FileRegion transfer with epoll transport Related issue: #2764 Motivation: EpollSocketChannel.writeFileRegion() does not handle the case where the position of a FileRegion is non-zero properly. Modifications: - Improve SocketFileRegionTest so that it tests the cases where the file transfer begins from the middle of the file - Add another jlong parameter named 'base_off' so that we can take the position of a FileRegion into account Result: Improved test passes. Corruption is gone.	2014-08-13 16:58:28 -07:00
Norman Maurer	286b89933c	Allow to obtain RecvByteBufAllocator.Handle to allow more flexible implementations Motivation: At the moment it's only possible for a user to set the RecvByteBufAllocator for a Channel but not access the Handle once it is assigned. This makes it hard to write more flexible implementations. Modifications: Add a new method to the Channel.Unsafe to allow access the the used Handle for the Channel. The RecvByteBufAllocator.Handle is created lazily. Result: It's possible to write more flexible implementatons that allow to adjust stuff on the fly for a Handle that is used by a Channel	2014-08-12 06:53:57 +02:00
Norman Maurer	869687bd71	Port ChannelOutboundBuffer and related changes from 4.0 Motivation: We did various changes related to the ChannelOutboundBuffer in 4.0 branch. This commit port all of them over and so make sure our branches are synced in terms of these changes. Related to [#2734], [#2709], [#2729], [#2710] and [#2693] . Modification: Port all changes that was done on the ChannelOutboundBuffer. This includes the port of the following commits: - `73dfd7c01b` - `997d8c32d2` - `e282e504f1` - `5e5d1a58fd` - `8ee3575e72` - `d6f0d12a86` - `16e50765d1` - `3f3e66c31a` Result: - Less memory usage by ChannelOutboundBuffer - Same code as in 4.0 branch - Make it possible to use ChannelOutboundBuffer with Channel implementation that not extends AbstractChannel	2014-08-05 15:00:45 +02:00
Trustin Lee	cc33417c0e	Add more utility methods to check the availability of the epoll transport Related issue: #2733 Motivation: Unlike OpenSsl, Epoll lacks a couple useful availability checker methods: - ensureAvailability() - unavailabilityCause() Modifications: Add missing methods Result: More ways to check the availability and to get the cause of unavailability programatically.	2014-08-04 15:05:05 -07:00
Norman Maurer	3207fac88e	Use correct exception message when throw exception from native code Motivation: We sometimes not use the correct exception message when throw it from the native code. Modifications: Fixed the message. Result: Correct message in exception	2014-07-28 13:33:54 -07:00
Norman Maurer	f88cd62354	[#2692 ] Allows notify ChannelFutureProgressListener on complete writes Motivation: We have some inconsistency when handling writes. Sometimes we call ChannelOutboundBuffer.progress(...) also for complete writes and sometimes not. We should call it always. Modifications: Correctly call ChannelOuboundBuffer.progress(...) for complete and incomplete writes. Result: Consistent behavior	2014-07-28 04:19:23 -07:00
Norman Maurer	d5b7c131dd	Correctly write single ByteBuf with memoryAddress Motivation: While optimize gathering writes I introduced a bug when writing single ByteBuf that have a memoryAddress. This regression was introduced by `88bd6e7a93`. Modifications: Correctly use the writerIndex as argument when call Native.writeAddress(...) Result: No more corruption while write single buffers.	2014-07-25 17:27:22 +02:00
Norman Maurer	88bd6e7a93	Optimize native transport for gathering writes Motivation: While benchmarking the native transport with gathering writes I noticed that it is quite slow. This is due the fact that we need to do a lot of array copies to get the buffers into the iov array. Modification: Introduce a new class calles IovArray which allows to fill buffers directly in a iov array that can be passed over to JNI without any array copies. This gives a nice optimization in terms of speed when doing gathering writes. Result: Big performance improvement when doing gathering writes. See the included benchmark... Before: [nmaurer@xxx]~% wrk/wrk -H 'Host: localhost' -H 'Accept: text/html,application/xhtml+xml,application/xml;q=0.9,/;q=0.8' -H 'Connection: keep-alive' -d 120 -c 256 -t 16 --pipeline 256 http://xxx:8080/plaintext Running 2m test @ http://xxx:8080/plaintext 16 threads and 256 connections Thread Stats Avg Stdev Max +/- Stdev Latency 23.44ms 16.37ms 259.57ms 91.77% Req/Sec 181.99k 31.69k 304.60k 78.12% 346544071 requests in 2.00m, 46.48GB read Requests/sec: 2887885.09 Transfer/sec: 396.59MB With this change: [nmaurer@xxx]~% wrk/wrk -H 'Host: localhost' -H 'Accept: text/html,application/xhtml+xml,application/xml;q=0.9,/;q=0.8' -H 'Connection: keep-alive' -d 120 -c 256 -t 16 --pipeline 256 http://xxx:8080/plaintext Running 2m test @ http://xxx:8080/plaintext 16 threads and 256 connections Thread Stats Avg Stdev Max +/- Stdev Latency 21.93ms 16.33ms 305.73ms 92.34% Req/Sec 194.56k 33.75k 309.33k 77.04% 369617503 requests in 2.00m, 49.57GB read Requests/sec: 3080169.65 Transfer/sec: 423.00MB	2014-07-25 09:55:02 +02:00
Norman Maurer	512d1a11ff	[#2685 ] Epoll transport should use GetPrimitiveArrayCritical / ReleasePrimitiveArrayCritical Motivation: At the moment we use GetArrayElement all the time in the epoll transport which may be wasteful as the JVM may do a memory copy for this. For code-path that will get executed fast (without blocking) we should better make use of GetPrimitiveArrayCritical and ReleasePrimitiveArrayCritical as this signal the JVM that we not want to do any memory copy if not really needed. It is important to only do this on non-blocking code-path as this may even suspend the GC to disallow the JVM to move the arrays around. See also http://docs.oracle.com/javase/7/docs/technotes/guides/jni/spec/functions.html#GetPrimitiveArrayCritical Modification: Make use of GetPrimitiveArrayCritical / ReleasePrimitiveArrayCritical as replacement for GetArrayElement / Release*ArrayElement where possible. Result: Better performance due less memory copies.	2014-07-21 07:11:37 +02:00
Norman Maurer	695fbc9140	[#2684 ] EpollSocketChannel gathering writes should take fast-path if possible Motivation: In EpollSocketchannel.writeBytesMultiple(...) we loop over all buffers to see if we need to adjust the readerIndex for incomplete writes. We can skip this if we know that everything was written (a.k.a complete write). Modification: Use fast-path if all bytes are written and so no need to loop over buffers Result: Fast write path for the average use.	2014-07-21 06:49:13 +02:00
Norman Maurer	9939c00541	Use the correct memoryAddress size when do a gathering write. Part of [#2680 ]	2014-07-21 06:27:30 +02:00
Norman Maurer	faf9ac9a30	[#2680 ] NioSocketChannelOutboundBuffer.nioBuffers() / EpollSocketChannelOutboundBuffer.memoryAddresses() should always return non-null array as stated in javadocs Motivation: At the moment NioSocketChannelOutboundBuffer.nioBuffers() / EpollSocketChannelOutboundBuffer.memoryAddresses() returns null if something is contained in the ChannelOutboundBuffer which is not a ByteBuf. This is a problem for two reasons: 1 - In the javadocs we state that it will never return null 2 - We may do a not optimal write as there may be things that could be written via gathering writes Modifications: Change NioSocketChannelOutboundBuffer.nioBuffers() / EpollSocketChannelOutboundBuffer.memoryAddresses() to never return null but have it contain all ByteBuffer that were found before the non ByteBuf. This way we can do a gathering write and also conform to the javadocs. Result: Better speed and also correct implementation in terms of the api.	2014-07-20 19:57:01 +02:00
Norman Maurer	aa66f556e5	[#2667 ] Write until EAGAIN in native transport and only call setEpollOut() in this case Motivation: In the previous fix for #2667 I did introduce a bit overhead by calling setEpollOut() too often. Modification: Only call setEpollOut() if really needed and remove unused code. Result: Less overhead when saturate network.	2014-07-18 20:44:37 +02:00
Norman Maurer	7e61538790	[#2665 ] Continue writing on IOException when using DatagramChannel Motivation: As a DatagramChannel supports to write to multiple remote peers we must not close the Channel once a IOException accours as this error may be only valid for one remote peer. Modification: Continue writing on IOException. Result: DatagramChannel can be used even after an IOException accours during writing.	2014-07-18 12:49:34 +02:00
Norman Maurer	14096d85cd	[#2667 ] Write until EAGAIN in native transport Motivation: We need to continue write until we hit EAGAIN to make sure we not see an starvation Modification: Write until EAGAIN is returned Result: No starvation when using native transport with ET.	2014-07-18 11:25:37 +02:00
Norman Maurer	89593a719b	[#2666 ] Fix possible NPE when try to fullfill connect ChannelPromise Motivation: Because of a missing return statement we may produce a NPE when try to fullfill the connect ChannelPromise when it was fullfilled before. Modification: Add missing return statement. Result: No more NPE.	2014-07-18 07:10:11 +02:00
Norman Maurer	4db6c65534	[#2647 ] Handle IOV_MAX in java code Motivation: The handling of IOV_MAX was done in JNI code base which makes stuff really complicated to maintain etc. Modifications: Move handling of IOV_MAX to java code to simplify stuff Result: Cleaner code.	2014-07-18 06:59:12 +02:00
Norman Maurer	89261fd8df	[#2664 ] Support write-spinning also in native transport Motivation: In our nio implementation we use write-spinning for maximize throughput, but in the native implementation this is not used. Modification: Respect writeSpinCount in native transport. Result: Better throughput	2014-07-18 06:33:16 +02:00
Norman Maurer	62bbd4220a	Fix JVM segfault during JNI call. Part of [#2647 ] Motivation: Currently when Native.writev(...) is used it is possible to see a JVM segfault because the offset is updated to early. Modification: Only update the offset once it is safe to do so. Result: No more segfault	2014-07-09 13:34:15 +02:00
Norman Maurer	fb22d34925	[#2647 ] Respect IOV_MAX when call writev in native transport Motivation: epoll transport fails on gathering write of more then 1024 buffers. As linux supports max. 1024 iov entries when calling writev(...) the epoll transport throws an exception. Thanks again to @blucas to provide me with a reproducer and so helped me to understand what the issue is. Modifications: Make sure we break down the writes if to many buffers are uses for gathering writes. Result: Gathering writes work with any number of buffers	2014-07-09 12:21:58 +02:00
Trustin Lee	e167b02d52	[maven-release-plugin] prepare for next development iteration	2014-07-04 17:26:02 +09:00
Trustin Lee	ba50cb829b	[maven-release-plugin] prepare release netty-4.1.0.Beta1	2014-07-04 17:25:54 +09:00
Trustin Lee	787663a644	[maven-release-plugin] rollback the release of netty-4.1.0.Beta1	2014-07-04 17:11:14 +09:00
Trustin Lee	83eae705e1	[maven-release-plugin] prepare release netty-4.1.0.Beta1	2014-07-04 17:02:17 +09:00
Trustin Lee	d0912f2709	Fix most inspector warnings Motivation: It's good to minimize potentially broken windows. Modifications: Fix most inspector warnings from our profile Update IntObjectHashMap Result: Cleaner code	2014-07-02 19:55:07 +09:00
Norman Maurer	7a05a617f6	[#2558 ] Define SO_REUSEPORT if not defined Motivation: Currently it is impossible to build netty on linux system that not define SO_REUSEPORT even if it is supported. Modification: Define SO_REUSEPORT if not defined. Result: Possible to build on more linux dists.	2014-07-02 09:40:10 +02:00
Norman Maurer	36b80c25f7	Correctly return from selector loop one a scheduled task is ready for processing Motivation: We use the nanoTime of the scheduledTasks to calculate the milli-seconds to wait for a select operation to select something. Once these elapsed we check if there was something selected or some task is ready for processing. Unfortunally we not take into account scheduled tasks here so the selection loop will continue if only scheduled tasks are ready for processing. This will delay the execution of these tasks. Modification: - Check if a scheduled task is ready after selecting - also make a tiny change in NioEventLoop to not trigger a rebuild if nothing was selected because the timeout was reached a few times in a row. Result: Execute scheduled tasks on time.	2014-07-02 09:10:37 +02:00
Norman Maurer	23840b76d7	[#2623 ] Release local references to guard against StackOverflow in JNI Motivation: When we do a (env)->GetObjectArrayElement(...) call we may created many local references which will only be cleaned up once we exist the native method. Thus a lot of memory can be used and so a StackOverFlow may be triggered. Beside this the JNI specification only say that an implementation must cope with 16 local references. Modification: Call (env)->ReleaseLocalRef(...) to release the resource once not needed anymore. Result: Less memory usage and guard against StackOverflow	2014-06-30 09:57:13 +02:00
Norman Maurer	776f3aeb3d	Let EpollReuseAddrTest also work with kernel versions that not have bugfix release part	2014-06-27 17:56:17 +02:00
Norman Maurer	76cd3a37e5	[#2598 ] Add Epoll.isAvailable() which allows to check if epoll can be used. Motivation: At the moment there is no simple way for a user to check if the native epoll transport can be used on the running platform. Thus the user can only try to instance it and catch any exception and fallback to nio transport. Modification: Add Epoll.isAvailable() which allows to check if epoll can be used. Result: User can easily check if epoll transport can be used or not	2014-06-26 12:27:09 +02:00
Norman Maurer	b0e5c9be00	[#2605 ] Use SO_REUSEADDR on EpollServerSocketChannel to match defaults of java.nio.ServerSocketChannel impl Motivation: When using openjdk and oracle jdk's nio (while using the nio transport) the ServerSocketChannel uses SO_REUSEADDR by default. Our native transport should do the same to make it easier to switch between the different implementations and get the expected result. Modification: Change EpollServerSocketChannelConfig to set SO_REUSEADDR on the created socket. Result: SO_REUSEADDR is used by default on servers.	2014-06-26 11:57:40 +02:00
Norman Maurer	31211487e9	Use IntObjectMap to replace Map in EpollEventLoop. Motivation: We need to map from ints to AbstractEpollChannel in EpollEventLoop but there is no need for box to Integer. Modification: Replace Map with IntObjectMap. Result: No more auto-boxing needed.	2014-06-25 20:22:58 +02:00
Trustin Lee	e99f26fe09	Make sure OpenSslEngine is tested against transport-native-epoll	2014-06-21 18:28:54 +09:00
Trustin Lee	b6c0c0c95f	Add an OpenSslEngine and the universal API for enabling SSL Motivation: Some users already use an SSLEngine implementation in finagle-native. It wraps OpenSSL to get higher SSL performance. However, to take advantage of it, finagle-native must be compiled manually, and it means we cannot pull it in as a dependency and thus we cannot test our SslHandler against the OpenSSL-based SSLEngine. For an instance, we had #2216. Because the construction procedures of JDK SSLEngine and OpenSslEngine are very different from each other, we also need to provide a universal way to enable SSL in a Netty application. Modifications: - Pull netty-tcnative in as an optional dependency. http://netty.io/wiki/forked-tomcat-native.html - Backport NativeLibraryLoader from 4.0 - Move OpenSSL-based SSLEngine implementation into our code base. - Copied from finagle-native; originally written by @jpinner et al. - Overall cleanup by @trustin. - Run all SslHandler tests with both default SSLEngine and OpenSslEngine - Add a unified API for creating an SSL context - SslContext allows you to create a new SSLEngine or a new SslHandler with your PKCS#8 key and X.509 certificate chain. - Add JdkSslContext and its subclasses - Add OpenSslServerContext - Add ApplicationProtocolSelector to ensure the future support for NPN (NextProtoNego) and ALPN (Application Layer Protocol Negotiation) on the client-side. - Add SimpleTrustManagerFactory to help a user write a TrustManagerFactory easily, which should be useful for those who need to write an alternative verification mechanism. For example, we can use it to implement an unsafe TrustManagerFactory that accepts self-signed certificates for testing purposes. - Add InsecureTrustManagerFactory and FingerprintTrustManager for quick and dirty testing - Add SelfSignedCertificate class which generates a self-signed X.509 certificate very easily. - Update all our examples to use SslContext.newClient/ServerContext() - SslHandler now logs the chosen cipher suite when handshake is finished. Result: - Cleaner unified API for configuring an SSL client and an SSL server regardless of its internal implementation. - When native libraries are available, OpenSSL-based SSLEngine implementation is selected automatically to take advantage of its performance benefit. - Examples take advantage of this modification and thus are cleaner.	2014-05-18 02:54:06 +09:00
Norman Maurer	59d92ad6cf	[#2485 ] Use RecvByteBufAllocator for all allocations related to read from Channel Motivation: At the moment we sometimes use only RecvByteBufAllocator.guess() to guess the next size and the use the ByteBufAllocator.* directly to allocate the buffer. We should always use RecvByteBufAllocator.allocate(...) all the time as this makes the behavior easier to adjust. Modifications: Change the read() implementations to make use of RecvByteBufAllocator. Result: Behavior is more consistent.	2014-05-10 15:27:43 +02:00
Norman Maurer	c5e4c2294e	[#2469 ] Correctly update indices after incomplete gathering write Motivation: When doing a gathering write we need to update the indices after the write partial completes. In the current code-base we use the wrong value when compare the expected written bytes and the actual written bytes. Modifications: Use the correct value when compare. Result: Indices are updated correctly and so no corruption can happen when resume writing after data was only partial written before.	2014-05-07 07:13:39 +02:00
Trustin Lee	296db210ab	Generate the default JAR to make oss.sonatype.org happy Motivation: oss.sonatype.org refuses to promote an artifact if it doesn't have the default JAR (the JAR without classifier.) Modifications: - Generate both the default JAR and the native JAR to make oss.sonatype.org happy - Rename the profile 'release' to 'restricted-release' which reflects what it really does better - Remove the redundant <quickbuild>true</quickbuild> in all/pom.xml We specify the profile 'full' that triggers that property already in maven-release-plugin configuration. Result: oss.sonatype.org is happy. Simpler pom.xml	2014-05-03 17:32:45 +09:00
Trustin Lee	6c1af9036f	Simplify native library resolution using os-maven-plugin Motivation: So far, we used a very simple platform string such as linux64 and linux32. However, this is far from perfection because it does not include anything about the CPU architecture. Also, the current build tries to put multiple versions of .so files into a single JAR. This doesn't work very well when we have to ship for many different platforms. Think about shipping .so/.dynlib files for both Linux and Mac OS X. Modification: - Use os-maven-plugin as an extension to determine the current OS and CPU architecture reliable at build time - Use Maven classifier instead of trying to put all shared libraries into a single JAR - NativeLibraryLoader does not guess the OS and bit mode anymore and it always looks for the same location regardless of platform, because the Maven classifier does the job instead. Result: Better scalable native library deployment and retrieval	2014-05-02 04:21:47 +09:00
Trustin Lee	b9039eaa82	Synchronized between 4.1 and master again (part 2) Motivation: 4 and 5 were diverged long time ago and we recently reverted some of the early commits in master. We must make sure 4.1 and master are not very different now. Modification: Remove ChannelHandlerInvoker.writeAndFlush(...) and the related implementations. Result: 4.1 and master got closer.	2014-04-25 15:06:26 +09:00

1 2

85 Commits