netty5

Author	SHA1	Message	Date
Norman Maurer	6faef55aef	Cleanup buffer tests. Motivation: Some of the tests in the buffer module contained unused code. Some of the tests also used unnecessary inheritance which could be avoided to simplify code. Modifications: Cleanup the test cases. Result: Cleaner code, less cruft.	2015-10-16 20:47:32 +02:00
Norman Maurer	31ef237085	Always return a real slice even when the length is 0 Motivation: We need to always return a real slice even when the requested length is 0. This is needed as otherwise we not correctly share the reference count and so may leak a buffer if the user call release() on the returned slice and expect it to decrement the reference count of the "parent" buffer. Modifications: - Always return a real slice - Add unit test for the bug. Result: No more leak possible when a user requests a slice of length 0 of a SlicedByteBuf.	2015-10-16 20:45:54 +02:00
Norman Maurer	291674262c	Added SlicedAbstractByteBuf that can provide fast-path for _get* and _set* methods Motivation: SlicedByteBuf can be used for any ByteBuf implementations and so can not do any optimizations that could be done when AbstractByteBuf is sliced. Modifications: - Add SlicedAbstractByteBuf that can eliminate range and reference count checks for _get* and _set* methods. Result: Faster SlicedByteBuf implementations for AbstractByteBuf sub-classes.	2015-10-16 08:59:58 +02:00
Norman Maurer	b8c73a4806	Added DuplicatedAbstractByteBuf that can provide fast-path for _get* and _set* methods Motivation: DuplicatedByteBuf can be used for any ByteBuf implementations and so can not do any optimizations that could be done when AbstractByteBuf is duplicated. Modifications: - Add DuplicatedAbstractByteBuf that can eliminate range and reference count checks for _get* and _set* methods. Result: Faster DuplicatedByteBuf implementations for AbstractByteBuf sub-classes.	2015-10-16 08:43:53 +02:00
Norman Maurer	054af70fed	Minimize object allocation when calling AbstractByteBuf.toString(..., Charset) Motivation: Calling AbstractByteBuf.toString(..., Charset) is used quite frequently by users but produce a lot of GC. Modification: - Use a FastThreadLocal to store the CharBuffer that are needed for decoding. - Use internalNioBuffer(...) when possible Result: Less object creation / Less GC	2015-10-15 17:49:21 +02:00
Norman Maurer	1103379e02	Allow to disable reference count checks on every access of the ByteBuf Motiviation: Checking reference count on every access on a ByteBuf can have some big performance overhead depending on how the access pattern is. If the user is sure that there are no reference count errors on his side it should be possible to disable the check and so gain the max performance. Modification: - Add io.netty.buffer.bytebuf.checkAccessible system property which allows to disable the checks. Enabled by default. - Add microbenchmark Result: Increased performance for operations on the ByteBuf.	2015-10-15 10:19:49 +02:00
Norman Maurer	af9dc2c6a6	Optimize and minimize bound checks Motivation: We should minimize and optimize bound checks as much as possible to get the most out of performance. Modifications: - Use bitwise operations to remove branching - Remove branches when possible Result: Better performance for various operations.	2015-10-15 10:18:20 +02:00
Norman Maurer	d6a00d0642	[#4313 ] ByteBufUtil.writeUtf8 should use fast-path for WrappedByteBuf Motivation: ByteBufUtil.writeUtf8(...) / writeUsAscii(...) can use a fast-path when writing into AbstractByteBuf. We should try to unwrap WrappedByteBuf implementations so we are able to do the same on wrapped AbstractByteBuf instances. Modifications: - Try to unwrap WrappedByteBuf to use the fast-path Result: Faster writing of utf8 and usascii for WrappedByteBuf instances.	2015-10-13 11:53:31 +02:00
Norman Maurer	99b4aec46d	[#4327 ] Ensure toString() will not throw IllegalReferenceCountException Motivation: As toString() is often used while logging we need to ensure this produces no exception. Modifications: Ensure we never throw an IllegalReferenceCountException. Result: Be able to log without produce exceptions.	2015-10-10 20:12:19 +02:00
Norman Maurer	696a287736	[maven-release-plugin] prepare for next development iteration	2015-09-30 09:31:26 +02:00
Norman Maurer	fb2d562306	[maven-release-plugin] prepare release netty-4.0.32.Final	2015-09-30 09:28:40 +02:00
Norman Maurer	3de8768601	[#3789 ] Correctly reset markers for all allocations when using PooledByteBufAllocator Motivation: We need to ensure all markers are reset when doing an allocation via the PooledByteBufAllocator. This was not the always the case. Modifications: Move all logic that needs to get executed when reuse a PooledByteBuf into one place and call it. Result: Correct behavior	2015-09-25 19:57:17 +02:00
Norman Maurer	bd928eaa38	[maven-release-plugin] prepare for next development iteration	2015-09-02 08:58:54 +02:00
Norman Maurer	26bbcc38c2	[maven-release-plugin] prepare release netty-4.0.31.Final	2015-09-02 08:57:57 +02:00
Matteo Merli	fd70dd658e	Added debug logging with effective value for io.netty.leakDetection.acquireAndReleaseOnly property Motivation: The configurable property value recently added was not logged like others properties. Modifications: Added debug log with effective value applied. Result: Consistent with other properties	2015-09-01 09:10:14 +02:00
Matteo Merli	2d4a8a75bb	Additional configuration for leak detection Motivation: Leak detector, when it detects a leak, will print the last 5 stack traces that touched the ByteBuf. In some cases that might not be enough to identify the root cause of the leak. Also, sometimes users might not be interested in tracing all the operations on the buffer, but just the ones that are affecting the reference count. Modifications: Added command line properties to override default values: * Allow to configure max number of stack traces to collect * Allow to only record retain/release operation on buffers Result: Users can increase the number of stack traces to debug buffer leaks with lot of retain/release operations.	2015-08-30 20:38:35 +02:00
Scott Mitchell	09dff41343	Restore derived buffer index/mark updates Motivation: As part of the revert process in https://github.com/netty/netty/pull/4138 some index and mark updates were lost. Modifications: - Restore the index / mark updates made in https://github.com/netty/netty/pull/3788 Result: Slice and Duplicate buffers index / marks are correctly initialized.	2015-08-27 10:24:48 -07:00
Scott Mitchell	33001e84ff	Revert "Add PooledSlicedByteBuf and PooledDuplicatedByteBuf" Motivation: Currently the "derived" buffer will only ever be recycled if the release call is made on the "derived" object, and the "wrapped" buffer ends up being "fully released" (aka refcount goes to 0). From my experience this is not the common use case and thus the "derived" buffers will not be recycled. Modifications: - revert https://github.com/netty/netty/pull/3788 Result: Less complexity, and less code to create new objects in majority of cases.	2015-08-26 13:23:42 -07:00
Matteo Merli	53f9438aec	MemoryRegionCache$Entry objects are not recycled Motivation: Even though MemoryRegionCache$Entry instances are allocated through a recycler they are not properly recycled, leaving a lot of instances to be GCed along with Recycler$DefaultHandle objects. Fixes #4071 Modification: Recycle Entry when done using it. Result: Less GCed objects.	2015-08-10 21:28:53 +02:00
Norman Maurer	148692705c	[maven-release-plugin] prepare for next development iteration	2015-07-24 10:11:44 +02:00
Norman Maurer	11cc2d5197	[maven-release-plugin] prepare release netty-4.0.30.Final	2015-07-24 09:54:20 +02:00
Norman Maurer	0a0292476e	[#3896 ] Unpooled.copiedBuffer(ByteBuffer) and copiedBuffer(ByteBuffer...) is not thread-safe. Motivation: As we modify the position of the passed in ByteBuffer's this methods are not thread-safe. Modifications: Duplicate the input ByteBuffers before copy the content to byte[]. Result: Unpooled.copiedBuffer(ByteBuffer) and copiedBuffer(ByteBuffer...) is now thread-safe.	2015-07-07 08:38:07 +02:00
Norman Maurer	75bb7882bf	[#3899 ] Fix javadoc to use netty 4 API. Motivation: The javadoc of ByteBuf contained some out-dated code. Modifications: Update code example in javadoc to use netty 4+ API Result: Correct javadocs	2015-07-03 14:18:27 +02:00
Louis Ryan	bb0b86ce50	Fix FixedCompositeByteBuf handling when copying to direct buffers and streams Motivation: FixedCompositeByteBuf does not properly implement a number of methods for copying its content to direct buffers and output streams Modifications: Replace improper use of capacity() with readableBytes() when computing offesets during writes Result: Copying works correctly	2015-06-27 21:21:36 +02:00
Norman Maurer	24de49cb82	Add FixedCompositeByteBuf which can be used to write an array of ByteBuf in an efficient way. This implementation does not produce as much GC pressure as CompositeByteBuf and so is prefered, for writing an array of ByteBufs. Be aware that FixedCompositeByteBuf is readonly. When using this in a project that make heavy use of CompositeByteBuf for writes we was able to cut down allocation to a half.	2015-06-27 20:57:24 +02:00
Norman Maurer	1da998bc7c	[maven-release-plugin] prepare for next development iteration	2015-06-23 11:08:27 +02:00
Norman Maurer	4c482c1215	[maven-release-plugin] prepare release netty-4.0.29.Final	2015-06-23 11:07:56 +02:00
Norman Maurer	1796cfc419	[#3888 ] Use 2 * cores as default minimum for pool arenas. Motivation: At the moment we use 1 * cores as default mimimum for pool arenas. This can easily lead to conditions as we use 2 * cores as default for EventLoop's when using NIO or EPOLL. If we choose a smaller number we will run into hotspots as allocation and deallocation needs to be synchronized on the PoolArena. Modifications: Change the default number of arenas to 2 * cores. Result: Less conditions when using the default settings.	2015-06-18 07:19:15 +02:00
Norman Maurer	9a30b6860f	[#3798 ] Extract dump method to ByteBufUtil Motivation: Dumping the content of a ByteBuf in a hex format is very useful. Modifications: Move code into ByteBufUtil so its easy to reuse. Result: Easy to reuse dumping code.	2015-06-12 14:05:32 +02:00
Norman Maurer	528415bc29	Update javadocs to highlight that derived buffers will not increment the reference count. Motivation: We not explain the derived buffers will not retain the parent buffer. Modifications: Add docs. Result: Correctly document behaviour	2015-06-06 19:54:03 +02:00
Norman Maurer	7e328fd1c3	Fix regression introduced by `f765053ae7` by use Entry after it is recycled	2015-05-27 16:56:05 +02:00
Norman Maurer	f765053ae7	Let PoolThreadCache work even if allocation and deallocation Thread are different Motivation: PoolThreadCache did only cache allocations if the allocation and deallocation Thread were the same. This is not optimal as often people write from differen thread then the actual EventLoop thread. Modification: - Add MpscArrayQueue which was forked from jctools and lightly modified. - Use MpscArrayQueue for caches and always add buffer back to the cache that belongs to the allocation thread. Result: ThreadPoolCache is now also usable and so gives performance improvements when allocation and deallocation thread are different. Performance when using same thread for allocation and deallocation is noticable worse then before.	2015-05-27 14:35:22 +02:00
Norman Maurer	f18990a8a5	[#3654 ] Synchronize on PoolSubpage head when allocate / free PoolSubpages Motivation: Currently we hold a lock on the PoolArena when we allocate / free PoolSubpages, which is wasteful as this also affects "normal" allocations. The same is true vice-verse. Modifications: Ensure we synchronize on the head of the PoolSubPages pool. This is done per size and so it is possible to concurrently allocate / deallocate PoolSubPages with different sizes, and also normal allocations. Result: Less condition and so faster allocation/deallocation. Before this commit: xxx:~/wrk $ ./wrk -H 'Connection: keep-alive' -d 120 -c 256 -t 16 -s scripts/pipeline-many.lua http://xxx:8080/plaintext Running 2m test @ http://xxx:8080/plaintext 16 threads and 256 connections Thread Stats Avg Stdev Max +/- Stdev Latency 17.61ms 29.52ms 689.73ms 97.27% Req/Sec 278.93k 41.97k 351.04k 84.83% 530527460 requests in 2.00m, 71.64GB read Requests/sec: 4422226.13 Transfer/sec: 611.52MB After this commit: xxx:~/wrk $ ./wrk -H 'Connection: keep-alive' -d 120 -c 256 -t 16 -s scripts/pipeline-many.lua http://xxx:8080/plaintext Running 2m test @ http://xxx:8080/plaintext 16 threads and 256 connections Thread Stats Avg Stdev Max +/- Stdev Latency 15.85ms 24.50ms 681.61ms 97.42% Req/Sec 287.14k 38.39k 360.33k 85.88% 547902773 requests in 2.00m, 73.99GB read Requests/sec: 4567066.11 Transfer/sec: 631.55MB This is reproducable every time.	2015-05-27 10:32:52 +02:00
Norman Maurer	7e80e1bf97	[#3654 ] No need to hold lock while destroy a chunk Motiviation: At the moment we sometimes hold the lock on the PoolArena during destroy a PoolChunk. This is not needed. Modification: - Ensure we not hold the lock during destroy a PoolChunk - Move all synchronized usage in PoolArena - Cleanup Result: Less condition.	2015-05-27 09:47:34 +02:00
Norman Maurer	8fed94eee3	Expose metrics for PooledByteBufAllocator Motivation: The PooledByteBufAllocator is more or less a black-box atm. We need to expose some metrics to allow the user to get a better idea how to tune it. Modifications: - Expose different metrics via PooledByteBufAllocator - Add *Metrics interfaces Result: It is now easy to gather metrics and detail about the PooledByteBufAllocator and so get a better understanding about resource-usage etc.	2015-05-20 21:01:37 +02:00
Norman Maurer	8a1a0b2c1e	Add PooledSlicedByteBuf and PooledDuplicatedByteBuf Motivation: At the moment when calling slice(...) or duplicate(...) on a Pooled*ByteBuf a new SlicedByteBuf or DuplicatedByteBuf. This can create a lot of GC. Modifications: Add PooledSlicedByteBuf and PooledDuplicatedByteBuf which will be used when a PooledByteBuf is used. Result: Less GC.	2015-05-20 11:37:41 +02:00
Norman Maurer	981292ffc0	Clarify ByteBuf.duplicate() semantics. Motivation: From the javadocs of ByteBuf.duplicate() it is not clear if the reader and writer marks will be duplicated. Modifications: Add sentence to clarify that marks will not be duplicated. Result: Clear semantics.	2015-05-20 08:32:43 +02:00
Norman Maurer	979fdfd3d3	Reset markers when obtain PooledByteBuf. Motivation: When allocate a PooledByteBuf we need to ensure to also reset the markers for the readerIndex and writerIndex. Modifications: - Correct reset the markers - Add test-case for it Result: Correctly reset markers.	2015-05-20 07:29:21 +02:00
Norman Maurer	dc73c15df3	No need to release lock and acquire again when allocate normal size. Motiviation: When tried to allocate tiny and small sized and failed to serve these out of the PoolSubPage we exit the synchronization block just to enter it again when call allocateNormal(...). Modification: Not exit the synchronized block until allocateNormal(...) is done. Result: Better performance.	2015-05-18 10:23:29 +02:00
Norman Maurer	d1c46ca987	[maven-release-plugin] prepare for next development iteration	2015-05-07 11:33:47 -04:00
Norman Maurer	005d4a42fc	[maven-release-plugin] prepare release netty-4.0.28.Final	2015-05-07 11:33:09 -04:00
Norman Maurer	0f4d3a981e	Revert "[maven-release-plugin] prepare for next development iteration" This reverts commit `3c10ffab5e`.	2015-05-07 11:02:03 -04:00
Norman Maurer	3c10ffab5e	[maven-release-plugin] prepare for next development iteration	2015-05-07 09:09:23 -04:00
Alwayswithme	c727c16707	ByteBufUtil use IndexOfProcessor to find occurrence. Motivation: The way of firstIndexOf and lastIndexOf iterating the ByteBuf is similar to forEachByte and forEachByteDesc, but have many range checks. Modifications: Use forEachByte and a IndexOfProcessor to find occurrence. Result: eliminate range checks	2015-05-07 09:50:07 +02:00
Norman Maurer	f242bc5a1a	[#3623 ] CompositeByteBuf.iterator() should return optimized Iterable Motivation: CompositeByteBuf.iterator() currently creates a new ArrayList and fill it with the ByteBufs, which is more expensive then it needs to be. Modifications: - Use special Iterator implementation Result: Less overhead when calling iterator()	2015-04-20 10:45:28 +02:00
garywu	8843d2b1a1	[#2925 ] Bug fix for NormalMemoryRegionCache overbooked for PoolThreadCache Motivation: When create NormalMemoryRegionCache for PoolThreadCache, we overbooked cache array size. This means unnecessary overhead for thread local cache as we will create multi cache enties for each element in cache array. Modifications: change: int arraySize = Math.max(1, max / area.pageSize); to: int arraySize = Math.max(1, log2(max / area.pageSize) + 1); Result: Now arraySize won't introduce unnecessary overhead. Changes to be committed: modified: buffer/src/main/java/io/netty/buffer/PoolThreadCache.java	2015-04-13 09:00:51 +02:00
Norman Maurer	bfb6189f77	Let CompositeByteBuf implement Iterable Motivation: CompositeByteBuf has an iterator() method but fails to implement Iterable Modifications: Let CompositeByteBuf implement Iterable<ByteBuf> Result: Easier usage	2015-04-12 13:38:39 +02:00
Norman Maurer	e4900cbdd3	Revert "Dereference when calling PooledByteBuf.deallocate()" This reverts commit `ddd19f4f21`.	2015-04-11 06:44:10 +02:00
Norman Maurer	ddd19f4f21	Dereference when calling PooledByteBuf.deallocate() Motivation: We missed to dereference the chunk and tmpNioBuf when calling deallocate(). This means the GC can not collect these as we still hold a reference while have the PooledByteBuf in the recycler stack. Modifications: Dereference chunk and tmpNioBuf. Result: GC can collect things.	2015-04-10 21:46:43 +02:00
Norman Maurer	30f60ac68a	Change PoolThreadCache to use LIFO for better cache performance Motiviation: At the moment we use FIFO for the PoolThreadCache which is sub-optimal as this may reduce the changes to have the cached memory actual still in the cpu-cache. Modification: - Change to use LIFO as this increase the chance to be able to serve buffers from the cpu-cache Results: Faster allocation out of the ThreadLocal cache. Before the commit: [xxx wrk]$ ./wrk -H 'Connection: keep-alive' -d 120 -c 256 -t 16 -s scripts/pipeline-many.lua http://xxx:8080/plaintext Running 2m test @ http://xxx:8080/plaintext 16 threads and 256 connections Thread Stats Avg Stdev Max +/- Stdev Latency 14.69ms 10.06ms 131.43ms 80.10% Req/Sec 283.89k 40.37k 433.69k 66.81% 533859742 requests in 2.00m, 72.09GB read Requests/sec: 4449510.51 Transfer/sec: 615.29MB After the commit: [xxx wrk]$ ./wrk -H 'Connection: keep-alive' -d 120 -c 256 -t 16 -s scripts/pipeline-many.lua http://xxx:8080/plaintext Running 2m test @ http://xxx:8080/plaintext 16 threads and 256 connections Thread Stats Avg Stdev Max +/- Stdev Latency 16.38ms 26.32ms 734.06ms 97.38% Req/Sec 283.86k 39.31k 361.69k 83.38% 540836511 requests in 2.00m, 73.04GB read Requests/sec: 4508150.18 Transfer/sec: 623.40MB	2015-04-10 20:57:31 +02:00
Norman Maurer	f2fedbcdef	[maven-release-plugin] prepare for next development iteration	2015-03-31 22:06:30 -04:00
Norman Maurer	054e7c5d17	[maven-release-plugin] prepare release netty-4.0.27.Final	2015-03-31 22:05:43 -04:00
Leo Gomes	2cc7466266	Updates the javadoc of Unpooled to remove mention to methods it does not provide Motivation: `Unpooled` javadoc's mentioned the generation of hex dump and swapping an integer's byte order, which are actually provided by `ByteBufUtil`. Modifications: Sentence moved to `ByteBufUtil` javadoc. Result: `Unpooled` javadoc is correct.	2015-03-04 12:03:50 +09:00
Norman Maurer	37264bb72b	[maven-release-plugin] prepare for next development iteration	2015-03-02 01:31:30 -05:00
Norman Maurer	0dbc96cffd	[maven-release-plugin] prepare release netty-4.0.26.Final	2015-03-02 01:30:58 -05:00
Norman Maurer	e99d89c04d	[maven-release-plugin] rollback the release of netty-4.0.26.Final	2015-02-28 21:28:06 +01:00
Norman Maurer	b86e2e6ac0	[maven-release-plugin] prepare release netty-4.0.26.Final	2015-02-28 13:55:01 -05:00
Norman Maurer	5f71b6dc54	Ensure CompositeByteBuf.addComponent* handles buffer in consistent way and not causes leaks Motivation: At the moment we have two problems: - CompositeByteBuf.addComponent(...) will not add the supplied buffer to the CompositeByteBuf if its empty, which means it will not be released on CompositeByteBuf.release() call. This is a problem as a user will expect everything added will be released (the user not know we not added it). - CompositeByteBuf.addComponents(...) will either add no buffers if none is readable and so has the same problem as addComponent(...) or directly release the ByteBuf if at least one ByteBuf is readable. Again this gives inconsistent handling and may lead to memory leaks. Modifications: - Always add the buffer to the CompositeByteBuf and so release it on release call. Result: Consistent handling and no buffer leaks.	2015-02-12 16:09:24 +01:00
Trustin Lee	0e61aeb849	[maven-release-plugin] prepare for next development iteration	2014-12-31 20:58:44 +09:00
Trustin Lee	087db82e78	[maven-release-plugin] prepare release netty-4.0.25.Final	2014-12-31 20:58:33 +09:00
Trustin Lee	0b8f47da04	Implement internal memory access methods of CompositeByteBuf correctly Motivation: When a CompositeByteBuf is empty (i.e. has no component), its internal memory access operations do not always behave as expected. Modifications: Check if the nunmber of components is zero. If so, return an empty array or an empty NIO buffer, etc. Result: More robustness	2014-12-30 15:52:57 +09:00
Trustin Lee	38fae09f23	Add more tests to EmptyByteBufTest - Ensure an EmptyByteBuf has an array, an NIO buffer, and a memory address at the same time - Add an assertion that checks if EMPTY_BUFFER is an EmptyByteBuf, just in case we make a mistake in the future	2014-12-30 15:50:49 +09:00
Norman Maurer	61a5e60513	Provide helper methods in ByteBufUtil to write UTF-8/ASCII CharSequences. Related to [#909 ] Motivation: We expose no methods in ByteBuf to directly write a CharSequence into it. This leads to have the user either convert the CharSequence first to a byte array or use CharsetEncoder. Both cases have some overheads and we can do a lot better for well known Charsets like UTF-8 and ASCII. Modifications: Add ByteBufUtil.writeAscii(...) and ByteBufUtil.writeUtf8(...) which can do the task in an optimized way. This is especially true if the passed in ByteBuf extends AbstractByteBuf which is true for all of our implementations which not wrap another ByteBuf. Result: Writing an ASCII and UTF-8 CharSequence into a AbstractByteBuf is a lot faster then what the user could do by himself as we can make use of some package private methods and so eliminate reference and range checks. When the Charseq is not ASCII or UTF-8 we can still do a very good job and are on par in most of the cases with what the user would do. The following benchmark shows the improvements: Result: 2456866.966 ?(99.9%) 59066.370 ops/s [Average] Statistics: (min, avg, max) = (2297025.189, 2456866.966, 2586003.225), stdev = 78851.914 Confidence interval (99.9%): [2397800.596, 2515933.336] Benchmark Mode Samples Score Score error Units i.n.m.b.ByteBufUtilBenchmark.writeAscii thrpt 50 9398165.238 131503.098 ops/s i.n.m.b.ByteBufUtilBenchmark.writeAsciiString thrpt 50 9695177.968 176684.821 ops/s i.n.m.b.ByteBufUtilBenchmark.writeAsciiStringViaArray thrpt 50 4788597.415 83181.549 ops/s i.n.m.b.ByteBufUtilBenchmark.writeAsciiStringViaArrayWrapped thrpt 50 4722297.435 98984.491 ops/s i.n.m.b.ByteBufUtilBenchmark.writeAsciiStringWrapped thrpt 50 4028689.762 66192.505 ops/s i.n.m.b.ByteBufUtilBenchmark.writeAsciiViaArray thrpt 50 3234841.565 91308.009 ops/s i.n.m.b.ByteBufUtilBenchmark.writeAsciiViaArrayWrapped thrpt 50 3311387.474 39018.933 ops/s i.n.m.b.ByteBufUtilBenchmark.writeAsciiWrapped thrpt 50 3379764.250 66735.415 ops/s i.n.m.b.ByteBufUtilBenchmark.writeUtf8 thrpt 50 5671116.821 101760.081 ops/s i.n.m.b.ByteBufUtilBenchmark.writeUtf8String thrpt 50 5682733.440 111874.084 ops/s i.n.m.b.ByteBufUtilBenchmark.writeUtf8StringViaArray thrpt 50 3564548.995 55709.512 ops/s i.n.m.b.ByteBufUtilBenchmark.writeUtf8StringViaArrayWrapped thrpt 50 3621053.671 47632.820 ops/s i.n.m.b.ByteBufUtilBenchmark.writeUtf8StringWrapped thrpt 50 2634029.071 52304.876 ops/s i.n.m.b.ByteBufUtilBenchmark.writeUtf8ViaArray thrpt 50 3397049.332 57784.119 ops/s i.n.m.b.ByteBufUtilBenchmark.writeUtf8ViaArrayWrapped thrpt 50 3318685.262 35869.562 ops/s i.n.m.b.ByteBufUtilBenchmark.writeUtf8Wrapped thrpt 50 2473791.249 46423.114 ops/s Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 1,387.417 sec - in io.netty.microbench.buffer.ByteBufUtilBenchmark Results : Tests run: 1, Failures: 0, Errors: 0, Skipped: 0 Results : Tests run: 1, Failures: 0, Errors: 0, Skipped: 0 The ViaArray benchmarks are basically doing a toString().getBytes(Charset) which the others are using ByteBufUtil.write*(...).	2014-12-26 15:57:59 +09:00
Norman Maurer	93237d435f	CompositeByteBuf.nioBuffers(...) must not return an empty ByteBuffer array Motivation: CompositeByteBuf.nioBuffers(...) returns an empty ByteBuffer array if the specified length is 0. This is not consistent with other ByteBuf implementations which return an ByteBuffer array of size 1 with an empty ByteBuffer included. Modifications: Make CompositeByteBuf.nioBuffers(...) consistent with other ByteBuf implementations. Result: Consistent and correct behaviour of nioBufffers(...)	2014-12-22 11:17:53 +01:00
Norman Maurer	fa079baf29	Always return SliceByteBuf on slice(...) to eliminate possible leak Motivation: When calling slice(...) on a ByteBuf the returned ByteBuf should be the slice of a ByteBuf and shares it's reference count. This is important as it is perfect legal to use buf.slice(...).release() and have both, the slice and the original ByteBuf released. At the moment this is only the case if the requested slice size is > 0. This makes the behavior inconsistent and so may lead to a memory leak. Modifications: - Never return Unpooled.EMPTY_BUFFER when calling slice(...). - Adding test case for buffer.slice(...).release() and buffer.duplicate(...).release() Result: Consistent behaviour and so no more leaks possible.	2014-12-22 11:15:28 +01:00
Norman Maurer	d62932c7de	Ensure buffer is not released when call array() / memoryAddress() Motivation: Before we missed to check if a buffer was released before we return the backing byte array or memoryaddress. This could lead to JVM crashes when someone tried various bulk operations on the UnsafeByteBuf implementations. Modifications: Always check if the buffer is released before all to return the byte array and memoryaddress. Result: No more JVM crashes because of released buffers when doing bulk operations on UnsafeByteBuf implementations.	2014-12-11 11:30:10 +01:00
Idel Pivnitskiy	3d200085a4	Small performance improvements Motivation: Found performance issues via FindBugs and PMD. Modifications: - Removed unnecessary boxing/unboxing operations in DefaultTextHeaders.convertToInt(CharSequence) and DefaultTextHeaders.convertToLong(CharSequence). A boxed primitive is created from a string, just to extract the unboxed primitive value. - Added a static modifier for DefaultHttp2Connection.ParentChangedEvent class. This class is an inner class, but does not use its embedded reference to the object which created it. This reference makes the instances of the class larger, and may keep the reference to the creator object alive longer than necessary. - Added a static compiled Pattern to avoid compile it each time it is used when we need to replace some part of authority. - Improved using of StringBuilders. Result: Performance improvements.	2014-11-20 00:58:35 -05:00
Norman Maurer	1914b77c71	[maven-release-plugin] prepare for next development iteration	2014-10-29 11:48:40 +01:00
Norman Maurer	c170e7df3f	[maven-release-plugin] prepare release netty-4.0.24.Final	2014-10-29 11:47:19 +01:00
Norman Maurer	cb85ed9d66	Disable caching of PooledByteBuf for different threads. Motivation: We introduced a PoolThreadCache which is used in our PooledByteBufAllocator to reduce the synchronization overhead on PoolArenas when allocate / deallocate PooledByteBuf instances. This cache is used for both the allocation path and deallocation path by: - Look for cached memory in the PoolThreadCache for the Thread that tries to allocate a new PooledByteBuf and if one is found return it. - Add the memory that is used by a PooledByteBuf to the PoolThreadCache of the Thread that release the PooledByteBuf This works out very well when all allocation / deallocation is done in the EventLoop as the EventLoop will be used for read and write. On the otherside this can lead to surprising side-effects if the user allocate from outside the EventLoop and and pass the ByteBuf over for writing. The problem here is that the memory will be added to the PoolThreadCache that did the actual write on the underlying transport and not on the Thread that previously allocated the buffer. Modifications: Don't cache if different Threads are used for allocating/deallocating Result: Less confusing behavior for users that allocate PooledByteBufs from outside the EventLoop.	2014-09-22 13:38:35 +02:00
Norman Maurer	687d3d3b5c	[#2924 ] Correctly update head in MemoryRegionCache.trim() Motivation: When MemoryRegionCache.trim() is called, some unused cache entries will be freed (started from head). However, in MeoryRegionCache.trim() the head is not updated, which make entry list's head point to an entry whose chunk is null now and following allocate of MeoryRegionCache will return false immediately. In other word, cache is no longer usable once trim happen. Modifications: Update head to correct idx after free entries in trim(). Result: MemoryRegionCache behaves correctly even after calling trim().	2014-09-22 10:56:17 +02:00
Norman Maurer	0bab6116f3	[#2843 ] Add test-case to show correct behavior of ByteBuf.refCnt() and ByteBuf.release(...) Motivation: We received a bug-report that the ByteBuf.refCnt() does sometimes not show the correct value when release() and refCnt() is called from different Threads. Modifications: Add test-case which shows that all is working like expected Result: Test-case added which shows everything is ok.	2014-09-01 08:48:15 +02:00
Trustin Lee	7710e7da44	[maven-release-plugin] prepare for next development iteration	2014-08-16 03:02:02 +09:00
Trustin Lee	208198c0cb	[maven-release-plugin] prepare release netty-4.0.23.Final	2014-08-16 03:01:57 +09:00
Trustin Lee	a2d508711d	[maven-release-plugin] prepare for next development iteration	2014-08-14 09:41:33 +09:00
Trustin Lee	3051db9d59	[maven-release-plugin] prepare release netty-4.0.22.Final	2014-08-14 09:41:28 +09:00
Trustin Lee	2b5aa716ba	Use heap buffers for Unpooled.copiedBuffer() Related issue: #2028 Motivation: Some copiedBuffer() methods in Unpooled allocated a direct buffer. An allocation of a direct buffer is an expensive operation, and thus should be avoided for unpooled buffers. Modifications: - Use heap buffers in all copiedBuffer() methods Result: Unpooled.copiedBuffers() are less expensive now.	2014-08-13 15:10:00 -07:00
Trustin Lee	fb5583d788	Refactoring in preparation to unify I/O logic for all branches Motivation: While trying to merge our ChannelOutboundBuffer changes we've made last week, I realized that we have quite a bit of conflicting changes at 4.1 and master. It was primarily because we added ChannelOutboundBuffer.beforeAdd() and moved some logic there, such as direct buffer conversion. However, this is not possible with the changes we've made for 4.0. We made ChannelOutboundBuffer final for example. Maintaining multiple branch is already getting painful and having different core will make it even worse, so I think we should keep the differences between 4.0 and other branches minimal. Modifications: - Move ChannelOutboundBuffer.safeRelease() to ReferenceCountUtil - Add ByteBufUtil.threadLocalBuffer() - Backported from ThreadLocalPooledDirectByteBuf - Make most methods in AbstractUnsafe final - Add AbstractChannel.filterOutboundMessage() so that a transport can convert a message to another (e.g. heap -> off-heap), and also reject unsupported messages - Move all direct buffer conversions to filterOutboundMessage() - Move all type checks to filterOutboundMessage() - Move AbstractChannel.checkEOF() to OioByteStreamChannel, because it's the only place it is used at all - Remove ChannelOutboundBuffer.current(Object), because it's not used anymore - Add protected direct buffer conversion methods to AbstractNioChannel and AbstractEpollChannel so that they can be used by their subtypes - Update all transport implementations according to the changes above Result: - The missing extension point in 4.0 has been added. - AbstractChannel.filterOutboundMessage() - Thanks to the new extension point, we moved all transport-specific logic from ChannelOutboundBuffer to each transport implementation - We can copy most of the transport implementations in 4.0 to 4.1 and master now, so that we have much less merge conflict when we modify the core.	2014-08-05 08:04:23 +02:00
Trustin Lee	07801d7b38	Remove duplicate range check in AbstractByteBuf.skipBytes()	2014-07-29 15:58:38 -07:00
Idel Pivnitskiy	01b11ca2cb	Small performance improvements Modifications: - Added a static modifier for CompositeByteBuf.Component. This class is an inner class, but does not use its embedded reference to the object which created it. This reference makes the instances of the class larger, and may keep the reference to the creator object alive longer than necessary. A boxed primitive is created from a String, just to extract the unboxed primitive value. - Removed unnecessary checks if file exists before call mkdirs() in NativeLibraryLoader and PlatformDependent. Because the method mkdirs() has this check inside. Conflicts: codec-http/src/main/java/io/netty/handler/codec/http/multipart/DiskAttribute.java codec-stomp/src/main/java/io/netty/handler/codec/stomp/StompSubframeAggregator.java codec-stomp/src/main/java/io/netty/handler/codec/stomp/StompSubframeDecoder.java	2014-07-20 09:29:33 +02:00
Norman Maurer	c4d0a87e19	[#2653 ] Remove unnecessary ensureAccessible() calls Motivation: I introduced ensureAccessible() class as part of `6c47cc9711` in some places. Unfortunally I also added some where these are not needed and so caused a performance regression. Modification: Remove calls where not needed. Result: Fixed performance regression.	2014-07-14 21:02:07 +02:00
Norman Maurer	ccd88596f3	[#2653 ] Remove uncessary range checks for performance reasons Motivation: I introduced range checks as part of `6c47cc9711` in some places. Unfortunally I also added some where these are not needed and so caused a performance regression. Modification: Remove range checks where not needed Result: Fixed performance regression.	2014-07-14 11:40:27 +02:00
Brendt Lucas	5061be1b03	[#2642 ] CompositeByteBuf.deallocate memory/GC improvement Motivation: CompositeByteBuf.deallocate generates unnecessary GC pressure when using the 'foreach' loop, as a 'foreach' loop creates an iterator when looping. Modification: Convert 'foreach' loop into regular 'for' loop. Result: Less GC pressure (and possibly more throughput) as the 'for' loop does not create an iterator	2014-07-08 21:08:34 +02:00
Trustin Lee	3d8d843198	Fix the build timeout when 'leak' profile is active Motivation: AbstractByteBufTest.testInternalBuffer() uses writeByte() operations to populate the sample data. Usually, this isn't a problem, but it starts to take a lot of time when the resource leak detection level gets higher. In our CI machine, testInternalBuffer() takes more than 30 minutes, causing the build timeout when the 'leak' profile is active (paranoid level resource detection.) Modification: Populate the sample data using ThreadLocalRandom.nextBytes() instead of using millions of writeByte() operations. Result: Test runs much faster when leak detection level is high.	2014-07-03 17:55:18 +09:00
Trustin Lee	0a8ff3b52d	Fix most inspector warnings Motivation: It's good to minimize potentially broken windows. Modifications: Fix most inspector warnings from our profile Update IntObjectHashMap Result: Cleaner code	2014-07-02 20:21:30 +09:00
Norman Maurer	e8f4def2a3	[maven-release-plugin] prepare for next development iteration	2014-06-30 14:31:08 +02:00
Norman Maurer	25e3c8ce3d	[maven-release-plugin] prepare release netty-4.0.21.Final	2014-06-30 14:29:15 +02:00
Norman Maurer	6c47cc9711	[#2622 ] Correctly check reference count before try to work on the underlying memory Motivation: Because of how we use reference counting we need to check for the reference count before each operation that touches the underlying memory. This is especially true as we use sun.misc.Cleaner.clean() to release the memory ASAP when possible. Because of this the user may cause a SEGFAULT if an operation is called that tries to access the backing memory after it was released. Modification: Correctly check the reference count on all methods that access the underlying memory or expose it via a ByteBuffer. Result: Safer usage of ByteBuf	2014-06-30 07:10:12 +02:00
Trustin Lee	d8d0bbfc26	Optimize PoolChunk - Using short[] for memoryMap did not improve performance. Reverting back to the original dual-byte[] structure in favor of simplicity. - Optimize allocateRun() which yields small performence improvement - Use local variable when member fields are accessed more than once	2014-06-26 17:06:29 +09:00
Trustin Lee	c41538050c	Fix inspector warnings	2014-06-26 17:06:29 +09:00
Pavan Kumar	d2e36a49c7	Improve the allocation algorithm in PoolChunk Motivation: Depth-first search is not always efficient for buddy allocation. Modification: Employ a new faster search algorithm with different memoryMap layout. Result: With thread-local cache disabled, we see a lot of performance improvment, especially when the size of the allocation is as small as the page size, which had the largest search space previously.	2014-06-26 17:06:29 +09:00
Trustin Lee	4a13f66e13	Remove 'get' prefix from all HTTP/SPDY messages Motivation: Persuit for the consistency in method naming Modifications: - Remove the 'get' prefix from all HTTP/SPDY message classes - Fix some inspector warnings Result: Consistency Fixes #2594	2014-06-24 18:33:30 +09:00
Norman Maurer	22f16e52bf	MessageToByteEncoder always starts with ByteBuf that use initalCapacity == 0 Motivation: MessageToByteEncoder always starts with ByteBuf that use initalCapacity == 0 when preferDirect is used. This is really wasteful in terms of performance as every first write into the buffer will cause an expand of the buffer itself. Modifications: - Change ByteBufAllocator.ioBuffer() use the same default initialCapacity as heapBuffer() and directBuffer() - Add new allocateBuffer method to MessageToByteEncoder that allow the user to do some smarter allocation based on the message that will be encoded. Result: Less expanding of buffer and more flexibilty when allocate the buffer for encoding.	2014-06-24 13:55:02 +09:00
Trustin Lee	7162d96ed5	Revert "Improve the allocation algorithm in PoolChunk" This reverts commit `36305d7dce`, which seems to cause an assertion failure on our CI machine.	2014-06-21 19:19:49 +09:00
Pavan Kumar	004ffbad90	Improve the allocation algorithm in PoolChunk Motivation: Depth-first search is not always efficient for buddy allocation. Modification: Employ a new faster search algorithm with different memoryMap layout. Result: With thread-local cache disabled, we see a lot of performance improvment, especially when the size of the allocation is as small as the page size, which had the largest search space previously: -- master head -- Benchmark (size) Mode Score Error Units pooledDirectAllocAndFree 8192 thrpt 215.392 1.565 ops/ms pooledDirectAllocAndFree 16384 thrpt 594.625 2.154 ops/ms pooledDirectAllocAndFree 65536 thrpt 1221.520 18.965 ops/ms pooledHeapAllocAndFree 8192 thrpt 217.175 1.653 ops/ms pooledHeapAllocAndFree 16384 thrpt 587.250 14.827 ops/ms pooledHeapAllocAndFree 65536 thrpt 1217.023 44.963 ops/ms -- changes -- Benchmark (size) Mode Score Error Units pooledDirectAllocAndFree 8192 thrpt 3656.744 94.093 ops/ms pooledDirectAllocAndFree 16384 thrpt 4087.152 22.921 ops/ms pooledDirectAllocAndFree 65536 thrpt 4058.814 29.276 ops/ms pooledHeapAllocAndFree 8192 thrpt 3640.355 44.418 ops/ms pooledHeapAllocAndFree 16384 thrpt 4030.206 24.365 ops/ms pooledHeapAllocAndFree 65536 thrpt 4103.991 70.991 ops/ms	2014-06-21 13:20:56 +09:00
Norman Maurer	19a1b603d0	Remove System.out.println(...) debug messages	2014-06-20 19:42:08 +02:00
Norman Maurer	d2b8560a76	[#2580 ] [#2587 ] Fix buffer corruption regression when ByteBuf.order(LITTLE_ENDIAN) is used Motivation: To improve the speed of ByteBuf with order LITTLE_ENDIAN and where the native order is also LITTLE_ENDIAN (intel) we introduces a new special SwappedByteBuf before in commit `4ad3984c8b`. Unfortunally the commit has a flaw which does not handle correctly the case when a ByteBuf expands. This was caused because the memoryAddress was cached and never changed again even if the underlying buffer expanded. This can lead to corrupt data or even to SEGFAULT the JVM if you are lucky enough. Modification: Always lookup the actual memoryAddress of the wrapped ByteBuf. Result: No more data-corruption for ByteBuf with order LITTLE_ENDIAN and no JVM crashes.	2014-06-20 18:25:54 +02:00
Trustin Lee	fb538ea532	Refactor FastThreadLocal to simplify TLV management Motivation: When Netty runs in a managed environment such as web application server, Netty needs to provide an explicit way to remove the thread-local variables it created to prevent class loader leaks. FastThreadLocal uses different execution paths for storing a thread-local variable depending on the type of the current thread. It increases the complexity of thread-local removal. Modifications: - Moved FastThreadLocal and FastThreadLocalThread out of the internal package so that a user can use it. - FastThreadLocal now keeps track of all thread local variables it has initialized, and calling FastThreadLocal.removeAll() will remove all thread-local variables of the caller thread. - Added FastThreadLocal.size() for diagnostics and tests - Introduce InternalThreadLocalMap which is a mixture of hard-wired thread local variable fields and extensible indexed variables - FastThreadLocal now uses InternalThreadLocalMap to implement a thread-local variable. - Added ThreadDeathWatcher.unwatch() so that PooledByteBufAllocator tells it to stop watching when its thread-local cache has been freed by FastThreadLocal.removeAll(). - Added FastThreadLocalTest to ensure that removeAll() works - Added microbenchmark for FastThreadLocal and JDK ThreadLocal - Upgraded to JMH 0.9 Result: - A user can remove all thread-local variables Netty created, as long as he or she did not exit from the current thread. (Note that there's no way to remove a thread-local variable from outside of the thread.) - FastThreadLocal exposes more useful operations such as isSet() because we always implement a thread local variable via InternalThreadLocalMap instead of falling back to JDK ThreadLocal. - FastThreadLocalBenchmark shows that this change improves the performance of FastThreadLocal even more.	2014-06-19 21:08:16 +09:00
Norman Maurer	6fdf1138ca	[#2573 ] UnpooledUnsafeDirectByteBuf.setBytes(int,ByteBuf,int,int) fails to use fast-path when src has array Motivation: UnpooledUnsafeDirectByteBuf.setBytes(int,ByteBuf,int,int) fails to use fast-path when src uses an array as backing storage. This is because the if else uses the wrong ByteBuf for its check. Modifications: - Use correct ByteBuf when check for array as backing storage - Also eliminate unecessary check in UnpooledDirectByteBuf which always fails anyway Result: Faster setBytes(...) when src ByteBuf is backed by an array. No more IndexOutOfBoundsException or data-corruption.	2014-06-16 11:11:54 +02:00
Norman Maurer	b737d631f1	[maven-release-plugin] prepare for next development iteration	2014-06-12 16:20:52 +02:00

1 2 3 4 5 ...

567 Commits