netty5

Author	SHA1	Message	Date
Trustin Lee	87346d14a8	Fix the inconsistencies between performance tests in ByteBufAllocatorBenchmark Motivation: default() tests are performing a test in a different way, and they must be same with other tests. Modification: Make sure default() tests are same with the others Result: Easier to compare default and non-default allocators	2014-06-21 13:27:28 +09:00
Trustin Lee	760bbc7ea6	Refactor FastThreadLocal to simplify TLV management Motivation: When Netty runs in a managed environment such as web application server, Netty needs to provide an explicit way to remove the thread-local variables it created to prevent class loader leaks. FastThreadLocal uses different execution paths for storing a thread-local variable depending on the type of the current thread. It increases the complexity of thread-local removal. Modifications: - Moved FastThreadLocal and FastThreadLocalThread out of the internal package so that a user can use it. - FastThreadLocal now keeps track of all thread local variables it has initialized, and calling FastThreadLocal.removeAll() will remove all thread-local variables of the caller thread. - Added FastThreadLocal.size() for diagnostics and tests - Introduce InternalThreadLocalMap which is a mixture of hard-wired thread local variable fields and extensible indexed variables - FastThreadLocal now uses InternalThreadLocalMap to implement a thread-local variable. - Added ThreadDeathWatcher.unwatch() so that PooledByteBufAllocator tells it to stop watching when its thread-local cache has been freed by FastThreadLocal.removeAll(). - Added FastThreadLocalTest to ensure that removeAll() works - Added microbenchmark for FastThreadLocal and JDK ThreadLocal - Upgraded to JMH 0.9 Result: - A user can remove all thread-local variables Netty created, as long as he or she did not exit from the current thread. (Note that there's no way to remove a thread-local variable from outside of the thread.) - FastThreadLocal exposes more useful operations such as isSet() because we always implement a thread local variable via InternalThreadLocalMap instead of falling back to JDK ThreadLocal. - FastThreadLocalBenchmark shows that this change improves the performance of FastThreadLocal even more.	2014-06-19 21:17:46 +09:00
belliottsmith	7d37af5dfb	Introduce FastThreadLocal which uses an EnumMap and a predefined fixed set of possible thread locals Motivation: Provide a faster ThreadLocal implementation Modification: Add a "FastThreadLocal" which uses an EnumMap and a predefined fixed set of possible thread locals (all of the static instances created by netty) that is around 10-20% faster than standard ThreadLocal in my benchmarks (and can be seen having an effect in the direct PooledByteBufAllocator benchmark that uses the DEFAULT ByteBufAllocator which uses this FastThreadLocal, as opposed to normal instantiations that do not, and in the new RecyclableArrayList benchmark); Result: Improved performance	2014-06-13 11:02:16 +02:00
Norman Maurer	405d573715	[#2436 ] UnsafeByteBuf implementation should only invert bytes if ByteOrder differ from native ByteOrder Motivation: Our UnsafeByteBuf implementation always invert bytes when the native ByteOrder is LITTLE_ENDIAN (this is true on intel), even when the user calls order(ByteOrder.LITTLE_ENDIAN). This is not optimal for performance reasons as the user should be able to set the ByteOrder to LITTLE_ENDIAN and so write bytes without the extra inverting. Modification: - Introduce a new special SwappedByteBuf (called UnsafeDirectSwappedByteBuf) that is used by all the Unsafe*ByteBuf implementation and allows to write without inverting the bytes. - Add benchmark - Upgrade jmh to 0.8 Result: The user is be able to get the max performance even on servers that have ByteOrder.LITTLE_ENDIAN as their native ByteOrder.	2014-06-05 10:59:03 +02:00
Trustin Lee	b50f91f6d0	More realistic ByteBuf allocation benchmark Motivation: Allocating a single buffer and releasing it repetitively for a benchmark will not involve the realistic execution path of the allocators. Modifications: Keep the last 8192 allocations and release them randomly. Result: We are now getting the result close to what we got with caliper.	2014-05-29 19:50:43 +09:00
Michael Nitschinger	64be9b2e4a	Upgrade JMH to 0.4.1 and make use of @Params.	2014-02-23 16:39:15 +01:00
Michael Nitschinger	6c02e19d10	Update JMH to 0.3.2	2014-02-14 13:15:49 -08:00
Michael Nitschinger	396519f559	Using SystemPropertyUtil for prperty parsing.	2014-01-15 18:48:33 +01:00
Michael Nitschinger	3b77a71ffd	Make JMH options modifiable through the subclassed benchmark.	2014-01-15 18:48:33 +01:00
Michael Nitschinger	78790056c7	microbench: move from Caliper to JMH	2014-01-14 14:55:35 +09:00
Trustin Lee	dba3aa2d4f	Add io.netty.noResourceLeak option to microbench	2013-06-25 11:07:14 +09:00
Prajwal Tuladhar	05850da863	enable checkstyle for test source directory and fix checkstyle errors	2013-03-30 13:18:57 +01:00
Trustin Lee	8d88acb4a7	Change ByteBufAllocator.buffer() to allocate a direct buffer only when the platform can handle a direct buffer reliably - Rename directbyDefault to preferDirect - Add a system property 'io.netty.prederDirect' to allow a user from changing the preference on launch-time - Merge UnpooledByteBufAllocator.DEFAULT_BY_* to DEFAULT	2013-03-05 17:55:24 +09:00
Trustin Lee	b9996908b1	Implement reference counting - Related: #1029 - Replace Freeable with ReferenceCounted - Add AbstractReferenceCounted - Add AbstractReferenceCountedByteBuf - Add AbstractDerivedByteBuf - Add EmptyByteBuf	2013-02-10 13:10:09 +09:00
Trustin Lee	03e68482bb	Remove ChannelBuf/ByteBuf.Unsafe - Fixes #826 Unsafe.isFreed(), free(), suspend/resumeIntermediaryAllocations() are not that dangerous. internalNioBuffer() and internalNioBuffers() are dangerous but it seems like nobody is using it even inside Netty. Removing those two methods also removes the necessity to keep Unsafe interface at all.	2012-12-17 17:41:21 +09:00
Trustin Lee	e37aeb38d6	Add the original copyright	2012-12-14 00:10:28 +09:00
Trustin Lee	6339feaa8f	Apply advanced JVM options to benchmarks / Fix duplicate uploads - Add common optimization options when launching a new JVM to run a benchmark - Fix a bug where a benchmark report is uploaded twice - Simplify pom.xml and move the build instruction messages to DefaultBenchmark - Print an empty line to prettify the output	2012-12-14 00:00:41 +09:00
Trustin Lee	b47fc77522	Add PooledByteBufAllocator + microbenchmark module This pull request introduces the new default ByteBufAllocator implementation based on jemalloc, with a some differences: * Minimum possible buffer capacity is 16 (jemalloc: 2) * Uses binary heap with random branching (jemalloc: red-black tree) * No thread-local cache yet (jemalloc has thread-local cache) * Default page size is 8 KiB (jemalloc: 4 KiB) * Default chunk size is 16 MiB (jemalloc: 2 MiB) * Cannot allocate a buffer bigger than the chunk size (jemalloc: possible) because we don't have control over memory layout in Java. A user can work around this issue by creating a composite buffer, but it's not always a feasible option. Although 16 MiB is a pretty big default, a user's handler might need to deal with the bounded buffers when the user wants to deal with a large message. Also, to ensure the new allocator performs good enough, I wrote a microbenchmark for it and made it a dedicated Maven module. It uses Google's Caliper framework to run and publish the test result (example) Miscellaneous changes: * Made some ByteBuf implementations public so that those who implements a new allocator can make use of them. * Added ByteBufAllocator.compositeBuffer() and its variants. * ByteBufAllocator.ioBuffer() creates a buffer with 0 capacity.	2012-12-13 22:35:06 +09:00

18 Commits