netty5

Author	SHA1	Message	Date
Trustin Lee	3a9f472161	Make retained derived buffers recyclable Related: #4333 #4421 #5128 Motivation: slice(), duplicate() and readSlice() currently create a non-recyclable derived buffer instance. Under heavy load, an application that creates a lot of derived buffers can put the garbage collector under pressure. Modifications: - Add the following methods which creates a non-recyclable derived buffer - retainedSlice() - retainedDuplicate() - readRetainedSlice() - Add the new recyclable derived buffer implementations, which has its own reference count value - Add ByteBufHolder.retainedDuplicate() - Add ByteBufHolder.replace(ByteBuf) so that.. - a user can replace the content of the holder in a consistent way - copy/duplicate/retainedDuplicate() can delegate the holder construction to replace(ByteBuf) - Use retainedDuplicate() and retainedSlice() wherever possible - Miscellaneous: - Rename DuplicateByteBufTest to DuplicatedByteBufTest (missing 'D') - Make ReplayingDecoderByteBuf.reject() return an exception instead of throwing it so that its callers don't need to add dummy return statement Result: Derived buffers are now recycled when created via retainedSlice() and retainedDuplicate() and derived from a pooled buffer	2016-05-17 11:16:13 +02:00
Norman Maurer	2472d8c3cf	[#5216 ] Correctly increment deallocationsHuge when call PoolArena.free Motivation: We called deallocationsHuge.decrement() but it needs to be increment() Modifications: Replace decrement() with increment() Result: Correct metrics.	2016-05-06 21:44:43 +02:00
Norman Maurer	9f5eb7d698	Add CharSequence operations to ByteBuf Motivation: Often users either need to read or write CharSequences to a ByteBuf. We should add methods for this to ByteBuf as we can do some optimizations for this depending on the implementation. Modifications: Add setCharSequence, writeCharSequence, getCharSequence and readCharSequence Result: Easier reading / writing of CharSequence with ByteBuf.	2016-05-06 16:52:00 +02:00
Carl Mastrangelo	cf07f984b1	Add @Deprecated when the javadoc says its deprecated Motivation: Reduce nag warnings when compiling, make it easier for IDEs to display what's deprecated. Modifications: Added @Deprecated in a few places Result: No more warnings.	2016-05-01 20:30:13 +02:00
Norman Maurer	d698746609	Add ByteBuf.asReadOnly() Motivation: We lately added ByteBuf.isReadOnly() which allows to detect if a buffer is read-only or not. We should add ByteBuf.asReadOnly() to allow easily access a read-only version of a buffer. Modifications: - Add ByteBuf.asReadOnly() - Deprecate Unpooled.unmodifiableBuffer(Bytebuf) Result: More consistent api.	2016-04-14 10:51:20 +02:00
Jongyeol Choi	96455a9558	Implement codec-redis Motivation: - To encode/decode RESP (REdis Serialization Protocol) using Netty - http://redis.io/topics/protocol Modifications: - Add RedisEncoder, RedisDecoder - Add RedisBulkStringAggregator and RedisArrayAggregator - Add tests Result: - Added codec-redis - codec-redis can encode/decode RESP (REdis Serialization Protocol)	2016-04-14 10:23:16 +02:00
Norman Maurer	7d3ca7fb92	Fix IndexOutOfBoundsException when FixedCompositeByteBuf is constructed with an empty array. Motivation: When FixedCompositeByteBuf was constructed with new ByteBuf[0] and IndexOutOfboundsException was thrown. Modifications: Fix constructor Result: No more exception	2016-04-14 08:07:55 +02:00
Norman Maurer	4ddb81f36f	Not cache SwappedByteBuf in AbstractByteBuf to reduce memory footprint. Motivation: We should not cache the SwappedByteBuf in AbstractByteBuf to reduce the memory footprint. Modifications: Not cache the SwappedByteBuf. Result: Less memory footprint.	2016-04-14 08:06:21 +02:00
Norman Maurer	2537880e5d	Fix typo in exception message Motivation: Typo in exception message. Modifications: Fix the typo. Result: No more typo.	2016-04-14 08:03:35 +02:00
Trustin Lee	0b078314b2	Add ByteBuf.isReadOnly() Motivation: It is sometimes useful to determins if a buffer is read-only. Modifications: Add ByteBuf.isReadOnly() Result: One more feature	2016-04-13 21:41:27 +09:00
Trustin Lee	57063b6db0	Implement the ByteBuf methods that should have been overridden Motivation: Some ByteBuf implementations do not override all necessary methods, which can lead to potentially sub-optimal behavior. Also, SlicedByteBuf does not perform the range check correctly due to missing overrides. Modifications: - Add missing overrides - Use unwrap() instead of direct member access in derived buffers for consistency - Merge unwrap0() into unwrap() using covariant return type - Deprecate AbstractDerivedByteBuf and its subtypes, because they were not meant to be public Result: Correctness	2016-04-13 21:05:57 +09:00
Norman Maurer	69070c37ba	ByteBuf.readBytes(...) should use the allocator of the buffer to create the new buffer. Related to [#5093 ] Motivation: ByteBuf.readBytes(...) uses Unpooled.buffer(...) internally which will use a heap ByteBuf and also not able to make use of the allocator which may be pooled. We should better make use of the allocator. Modifications: Use the allocator for thenew buffer. Result: Take allocator into account when copy bytes.	2016-04-09 20:28:17 +02:00
Norman Maurer	200ca39b5c	Add PooledByteBufAllocator.dumpStats() which allows to obtain a human-readable status of the allocator. Motiviation: Sometimes it is useful to dump the status of the PooledByteBufAllocator and log it. Doing this is currently a bit cumbersome as the user needs to basically iterate through all the metrics and compose the String. we would better provide an easy way to do this. Modification: Add dumpStats() method. Result: Easier to get a view into the status of the allocator.	2016-04-09 19:16:53 +02:00
Norman Maurer	cfa258e096	PoolChunkList.allocate(...) should return false without the need to walk all the contained PoolChunks when the requested capacity is too big. Motivation: PoolChunkList.allocate(...) should return false without the need to walk all the contained PoolChunks when the requested capacity is larger then the capacity that can be allocated out of the PoolChunks in respect to the minUsage() and maxUsage() of the PoolChunkList. Modifications: Precompute the maximal capacity that can be allocated out of the PoolChunks that are contained in the PoolChunkList and use this to fast return from the allocate(...) method if an allocation capacity larger then that is requested. Result: Faster detection of allocations that can not be handled by the PoolChunkList and so faster allocations in general via the PoolArena.	2016-04-09 19:00:20 +02:00
Norman Maurer	2ae97946dd	Allow to retrieve the number of active bytes per PoolArena. Motivation: To better understand how much memory is used by Netty for ByteBufs it is useful to understand how many bytes are currently active (allocated) per PoolArena. Modifications: - Add PoolArenaMetric.numActiveBytes() Result: The user is able to get better insight into the PooledByteBufAllocator.	2016-04-09 18:54:05 +02:00
Norman Maurer	787a55874b	Cleanup PoolChunk and PoolArena Motivation: To make it easier to understand PoolChunk and PoolArena we should cleanup duplicated code. Modifications: - Move reused code into methods - Use Math.max(...) Result: Cleaner code and easier to understand.	2016-04-09 18:47:40 +02:00
Norman Maurer	1b4a5609d3	Not try to allocate out of the PoolChunkList that contains only full PoolChunks Motivation: When doing a normal allocation in PoolArena we also tried to allocate out of the PoolChunkList that only contains completely full PoolChunks. This makes no sense as these are full anyway so an allocation will never work here and just gives a perf hit as we need to walk the whole list of PoolChunks in the list. Modifications: Not try to allocate from PoolChunkList that only contains full PoolChunks Result: Faster allocation times when a new PoolChunk must be created.	2016-04-09 18:39:56 +02:00
Norman Maurer	7b121c26ae	Use Math.min(...) and Math.max(...) as these are intrinsics. Motivation: We should better use Math utilities as these are intrinsics. This is a cleanup for `ea3ffb8536`. Modifications: Use Math utilities. Result: Cleaner code and use of intrinsics.	2016-04-06 21:30:32 +02:00
Norman Maurer	88b093c481	PoolChunkList not correctly move PoolChunks when these are moved. Motivation: When a PoolChunk needs to get moved to the previous PoolChunkList because of the minUsage / maxUsage constraints we always just moved it one level which is incorrect and so could lead to have PoolChunks in the wrong PoolChunkList (in respect to their minUsage / maxUsage settings). This then could have the effect that PoolChunks are not released / freed in a timely fashion and so. Modifications: - Correctly move PoolChunks between PoolChunkLists, which includes moving it multiple "levels". - Add unit test Result: Correctlty move the PoolChunk to PoolChunkList when it is freed, even if its multiple layers.	2016-04-06 11:52:23 +02:00
Norman Maurer	ea3ffb8536	Fix PoolChunkList.minUsage() and maxUsage() for head and tail Motivation: The PoolChunkList.minUsage() and maxUsage() needs to take special action to translate Integer.MIN_VALUE / MAX_VALUE as these are used internal for tail and head of the linked-list structure. Modifications: - Correct the minUsage() and maxUsage() methods. - Add unit test. Result: Correct metrics	2016-04-06 10:03:28 +02:00
Norman Maurer	4e779cd6b0	Allow to disable leak detection for all buffers when using UnpooledByteBufAllocator. Motivation: Sometimes it is useful to allow to disable the leak detection of buffers if the UnpooledByteBufAllocator is used. This is for example true if the app wants to leak buffers into user code but not want to put the burden on the user to always release the buffer. Modifications: Add another constructor to UnpooledByteBufAllocator that allows to completely disable leak-detection for all buffers that are allocator out of the UnpooledByteBufAllocator. Result: It's possible to disable leak-detection when the UnpooledByteBufAllocator is used.	2016-04-05 13:17:16 +02:00
Norman Maurer	1d23e358b4	Only increment metric for huge / normal allocations after the allocation was really done. Motivation: We should only increment the metric for the huge / normal allocation after it is done. Also we should only decrement once deallocate. Modifications: - Move increment after the allocation. - Fix deallocation metric and move it after deallocation Result: More correct metrics.	2016-04-05 12:00:44 +02:00
Norman Maurer	b32e07c75a	PoolThreadCache uses wrong variable in IllegalArgumentException Motivation: PoolThreadCache includes the wrong value when throwing a IllegalArgumentException because of freeSweepAllocationThreshold. Modifications: Use the correct value. Result: Correct exception message.	2016-04-05 11:48:38 +02:00
Karas Lukáš	42419d918d	Fix setBytes when source is read-only ByteBuffer and target is pooled buffer Motivation: The method setBytes creates temporary heap buffer when source buffer is read-only. But this temporary buffer is not used correctly and may lead to data corruption. This problem occurs when target buffer is pooled and temporary buffer arrayOffset() is not zero. Modifications: Use correct arrayOffset when calling PlatformDependent.copyMemory. Unit test was added to test this case. Result: Setting buffer content works correctly when target is pooled buffer and source is read-only ByteBuffer.	2016-03-22 09:18:44 +01:00
Norman Maurer	8d499a2419	Fix calculation of PoolArena metrics after introducing a regression in `89da788fd2`	2016-03-17 10:37:22 +01:00
Norman Maurer	daa4efcfef	Add proper synchronization when access metrics. Motivation: We also need to add synchronization when access fields to ensure we see the latest updates. Modifications: Add synchronization when read fields that are written concurrently. Result: Ensure correct visibility of updated.	2016-03-17 09:05:48 +01:00
Xiaoyan Lin	01835fdf18	Add LineEncoder to append a line separator automatically Motivation: See #1811 Modifications: Add LineEncoder and LineSeparator Result: The user can use LineEncoder to write a String with a line separator automatically	2016-03-16 20:31:01 +01:00
Norman Maurer	6796604f46	Remove double spacing Motivation: We had some double spacing in the methods which should be removed to keep things consistent. Modifications: Remove redundant spaces. Result: Cleaner / consistent coding style.	2016-03-16 10:25:48 +01:00
Norman Maurer	7d12333c38	Add final keyword which was missing in `47b598e6ce` Motivation: The two fields should have final keyword. Modifications: Add final keyword Result: Cleaner code.	2016-03-16 10:25:00 +01:00
buchgr	b88a980482	Change arena to thread cache mapping algorithm to be closer to ideal. Motivation: Circular assignment of arenas to thread caches can lead to less than optimal mappings in cases where threads are (frequently) shutdown and started. Example Scenario: There are a total of 2 arenas. The first two threads performing an allocation would lead to the following mapping: Thread 0 -> Arena 0 Thread 1 -> Arena 1 Now, assume Thread 1 is shut down and another Thread 2 is started. The current circular assignment algorithm would lead to the following mapping: Thread 0 -> Arena 0 Thread 2 -> Arena 0 Ideally, we want Thread 2 to use Arena 1 though. Presumably, this is not much of an issue for most Netty applications that do all the allocations inside the eventloop, as eventloop threads are seldomly shut down and restarted. However, applications that only use the netty-buffer package or implement their own threading model outside the eventloop might suffer from increased contention. For example, gRPC Java when using the blocking stub performs some allocations outside the eventloop and within its own thread pool that is dynamically sized depending on system load. Modifications: Implement a linear scan algorithm that assigns a new thread cache to the arena that currently backs the fewest thread caches. Result: Closer to ideal mappings between thread caches and arenas. In order to always get an ideal mapping, we would have to re-balance the mapping whenever a thread dies. However, that's difficult because of deallocation.	2016-03-15 14:16:34 +01:00
buchgr	35771dd1cd	Fix race in PoolArena.allocate. Fixes #4829 Motivation: The statistic counters PoolArena.(allocationsTiny\|allocationsSmall) are not protected by a per arena lock, but by a per size class lock. Thus, two concurrent allocations of different size (class) could lead to a race and ultimately to wrong statistics. Modifications: Use a thread-safe LongCounter instead of a plain long data type. Result: Fewer data races.	2016-03-14 08:57:46 +01:00
Xiaoyan Lin	e2d4e22243	Add CharsetUtil.encoder/decoder() methods Motivation: See #3321 Modifications: 1. Add CharsetUtil.encoder/decoder() methods 2. Deprecate CharsetUtil.getEncoder/getDecoder() methods Result: Users can use new CharsetUtil.encoder/decoder() to specify error actions	2016-03-07 10:48:31 +00:00
Scott Mitchell	94f27be59b	ByteBufUtil writeUtf8 and writeAscii helper methods Motivation: Utility methods in ByteBufUtil to writeUtf8 and writeAscii expect a buffer to already be allocated. If the user does not have a buffer allocated they have to know details of the encoding in order to know the size of the buffer to allocate. Modifications: - Add writeUtf8 and writeAscii which take a ByteBufAllocator and allocate a ByteBuf of the correct size for the user Result: ByteBufUtil methods which are easier to use if the user doesn't already have a ByteBuf.	2016-02-29 10:09:06 +01:00
Norman Maurer	74495fd27f	Add advanced leakdetection for methods introduced by [#4842 ] Motivation: [#4842] introduced 4 new methods but missed to implement advanced leak detection for these. Modifications: Correctly implement advanced leak detection for these methods. Result: Advanced leak detection works for all methods as expected.	2016-02-17 09:40:42 -08:00
Xiaoyan Lin	ccb0870600	Add methods with position independent FileChannel calls to ByteBuf Motivation See ##3229 Modifications: Add methods with position independent FileChannel calls to ByteBuf and its subclasses. Results: The user can use these new methods to read/write ByteBuff without updating FileChannel's position.	2016-02-14 20:37:37 -08:00
Scott Mitchell	691bc1690e	ByteBufUtil.writeUtf8 not consistent with String.getBytes(Charset) Motivation: `f750d6e36c` added support for surrogates in the writeUtf8 conversion. However exceptions are thrown if invalid input is detected, but the JDK (and slow path of writeUtf8) uses a replacement character and does not throw. We should behave the same way. Modificiations: - Don't throw in ByteBufUtil.writeUtf8, and instead use a replacement character consistent with the JDK Result: ByteBufUtil.writeUtf8 behavior is consistent with the JDK UTF_8 conversion.	2016-02-11 08:24:26 -08:00
Norman Maurer	19907030d1	[#4841 ] Fix segfault if UnpooledUnsafeHeapByteBuf.getShort(..) is used and UNALGINED access is not possible. Motivation: We missed to take the byte[] into account when try to access the bytes and so produce a segfault. Modifications: Correctly pass the byte[] in. Result: No more segfault.	2016-02-05 09:24:32 +01:00
Scott Mitchell	6312c2f00b	CompositeByteBuf.addComponent always assume reference count ownership Motivation: The current interface for CompositeByteBuf.addComponent is not clear under what conditions ownership is transferred when addComponent is called. There should be a well defined behavior so that users can ensure that no leaks occur. Modifications: - CompositeByteBuf.addComponent should always assume reference count ownership Result: Users that call CompositeByteBuf.addComponent do not have to independently check if the buffer's ownership has been transferred and if not independently release the buffer. Fixes https://github.com/netty/netty/issues/4760	2016-02-02 11:38:11 -08:00
Norman Maurer	cdb70d31ee	[#4017 ] Implement proper resource leak detection for CompositeByteBuf Motivation: CompositeByteBuf only implemented simple resource leak detection and how it was implemented was completly different to the way it was for ByteBuf. The other problem was that slice(), duplicate() and others would not return a resource leak enabled buffer. Modifications: - Proper implementation for all level of resource leak detection for CompositeByteBuf Result: Proper resource leak detection for CompositeByteBuf.	2016-01-18 10:14:39 +01:00
Norman Maurer	45674baf3e	Correctly report LE operations when advanced leak detection is used. Motivation: We missed reporting LE operations when AdvancedLeakAwareByteBuf was used. This could lead to incomplete access reports. Modifications: Correctly record access for *LE operations. Result: Correct leak reports.	2016-01-12 19:57:02 +01:00
Norman Maurer	f41aa306e9	Remove multiple calls to recordLeakNonRefCountingOperation() in AdvancedLeakAwareByteBuf.forEachByteDesc Motivation: AdvancedLeakAwareByteBuf.forEachByteDesc(...) called recordLeakNonRefCountingOperation() two times which resulted in incorrect leak detection reports. Modifications: Remove duplicated call to recordLeakNonRefCountingOperation() Result: Correct leak detection results	2016-01-12 19:53:23 +01:00
Xiaoyan Lin	6fe0db4001	Fix IndexOutOfBoundsException for CompositeByteBuf #4679 Motivation: Modifications: Use the correct start index Result: Fixes #4679	2016-01-08 19:11:57 -08:00
Scott Mitchell	e578134b57	Unpooled and Wrapped Buffer Leak Motivation: There are a few buffer leaks related to how Unpooled.wrapped and Base64.encode is used. Modifications: - Fix usages of Bas64.encode to correct leaks - Clarify interface of Unpooled.wrapped* to ensure reference count ownership is clearly defined. Result: Reference count code is more clearly defined and less leaks are possible.	2016-01-07 12:02:52 -08:00
Sergey Polovko	0efb679585	replace ends -> starts	2015-12-28 08:03:22 +01:00
Xiaoyan Lin	475d901131	Fix errors reported by javadoc Motivation: Javadoc reports errors about invalid docs. Modifications: Fix some errors reported by javadoc. Result: A lot of javadoc errors are fixed by this patch.	2015-12-27 08:36:45 +01:00
Xiaoyan Lin	a96d52fe66	Fix javadoc links and tags Motivation: There are some wrong links and tags in javadoc. Modifications: Fix the wrong links and tags in javadoc. Result: These links will work correctly in javadoc.	2015-12-26 08:34:31 +01:00
Scott Mitchell	f750d6e36c	ByteBufUtil.writeUtf8 Surrogate Support Motivation: UTF-16 can not represent the full range of Unicode characters, and thus has the concept of Surrogate Pair (http://unicode.org/glossary/#surrogate_pair) where 2 16-bit code units can be used to represent the missing characters. ByteBufUtil.writeUtf8 is currently does not support this and is thus incomplete. Modifications: - Add support for surrogate pairs in ByteBufUtil.writeUtf8 Result: ByteBufUtil.writeUtf8 now supports surrogate pairs and is correctly converting to UTF-8.	2015-12-18 13:51:52 -08:00
Norman Maurer	4e467f5c6f	Throw ReadOnlyBufferException if unsafe buffer is used and dst is direct Motivation: We missed to check if the dst is ready only before using unsafe to copy data into it which lead to data-corruption. We need to ensure we respect ready only ByteBuffer. Modifications: - Correctly check if the dst is ready only before copy data into it in UnsafeByteBufUtil - Also make it work for buffers that are not direct and not have an array Result: No more data corruption possible if the dst buffer is readonly and unsafe buffer implementation is used.	2015-12-17 13:25:21 +01:00
Fabian Lange	e5386b05e6	Move Hex dump related util from ByteBufUtil to inner class Motivation: Initialisation of the ByteBufUtil class, a class frequently used is delayed because a significant number of String operations is performed to fill a HEXDUMP_ROWPREFIXES array. This array also sticks to the Strings forever. It is quite likely that applications never use the hexdump facility. Modification: Moved the static initialisation and references to a static inner class. This delays initialisation (and memory usage) until actually needed. The API is kept as is. Result: Faster startup time, less memory usage for most netty using applications.	2015-12-11 19:47:57 +01:00
Alex Petrov	0f9492c9af	Add first-class Little Endian support to ByteBuf and descendants As discussed in #3209, this PR adds Little Endian accessors to ByteBuf and descendants. Corresponding accessors were added to UnsafeByteBufUtil, HeapByteBufferUtil to avoid calling `reverseBytes`. Deprecate `order()`, `order(buf)` and `SwappedByteBuf`.	2015-11-26 20:30:24 +01:00
Janecek Jakub	a4ebdd0eca	Fix setBytes on read-only ByteBuffer Motivation: The method setBytes did not work correctly because read-only ByteBuffer does not allow access to its underlying array. Modifications: New case was added for ByteBuffer's that are not direct and do not have an array. These must be handled by copying the data into a temporary array. Unit test was added to test this case. Result: It is now possible to use read-only ByteBuffer as the source for the setBytes method.	2015-11-17 22:45:37 -08:00
Sylwester Lachiewicz	a18416df60	Export defaults from PooledByteBufAcclocator static fields Motivation: Allow external application to tune initialization of PooledByteBufAllocator Modifications: Added new static methods Result: Exported DEFAULT_NUM_HEAP_ARENA DEFAULT_NUM_DIRECT_ARENA DEFAULT_PAGE_SIZE DEFAULT_MAX_ORDER DEFAULT_TINY_CACHE_SIZE DEFAULT_SMALL_CACHE_SIZE DEFAULT_NORMAL_CACHE_SIZE	2015-11-08 08:08:23 +01:00
Scott Mitchell	19658e9cd8	HTTP/2 Headers Type Updates Motivation: The HTTP/2 RFC (https://tools.ietf.org/html/rfc7540#section-8.1.2) indicates that header names consist of ASCII characters. We currently use ByteString to represent HTTP/2 header names. The HTTP/2 RFC (https://tools.ietf.org/html/rfc7540#section-10.3) also eludes to header values inheriting the same validity characteristics as HTTP/1.x. Using AsciiString for the value type of HTTP/2 headers would allow for re-use of predefined HTTP/1.x values, and make comparisons more intuitive. The Headers<T> interface could also be expanded to allow for easier use of header types which do not have the same Key and Value type. Motivation: - Change Headers<T> to Headers<K, V> - Change Http2Headers<ByteString> to Http2Headers<CharSequence, CharSequence> - Remove ByteString. Having AsciiString extend ByteString complicates equality comparisons when the hash code algorithm is no longer shared. Result: Http2Header types are more representative of the HTTP/2 RFC, and relationship between HTTP/2 header name/values more directly relates to HTTP/1.x header names/values.	2015-10-30 15:29:44 -07:00
Norman Maurer	a47685b243	Use bitwise operation when sampling for resource leak detection. Motivation: Modulo operations are slow, we can use bitwise operation to detect if resource leak detection must be done while sampling. Modifications: - Ensure the interval is a power of two - Use bitwise operation for sampling - Add benchmark. Result: Faster sampling.	2015-10-29 19:18:44 +01:00
Norman Maurer	d93f906255	[#4198 ] Fix race-condition when allocate from multiple-thread. Motivation: Fix a race condition that was introduced by `f18990a8a5` that could lead to a NPE when allocate from the PooledByteBufAllocator concurrently by many threads. Modifications: Correctly synchronize on the PoolSubPage head. Result: No more race.	2015-10-27 07:39:42 +01:00
Norman Maurer	2be4bb74a5	Share code between Unsafe ByteBuf implementations Motiviation: We have a lot of duplicated code which makes it hard to maintain. Modification: Move shared code to UnsafeByteBufUtil and use it in the implementations. Result: Less duplicated code and so easier to maintain.	2015-10-23 12:05:01 +02:00
Norman Maurer	992f8db0b4	Share code between Heap ByteBuf implementations Motiviation: We have a lot of duplicated code which makes it hard to maintain. Modification: Move shared code to HeapByteBufUtil and use it in the implementations. Result: Less duplicated code and so easier to maintain.	2015-10-23 11:53:34 +02:00
Norman Maurer	7d4c077492	Add *UnsafeHeapByteBuf for improve performance on systems with sun.misc.Unsafe Motivation: sun.misc.Unsafe allows us to handle heap ByteBuf in a more efficient matter. We should use special ByteBuf implementation when sun.misc.Unsafe can be used to increase performance. Modifications: - Add PooledUnsafeHeapByteBuf and UnpooledUnsafeHeapByteBuf that are used when sun.misc.Unsafe is ready to use. - Add UnsafeHeapSwappedByteBuf Result: Better performance when using heap buffers and sun.misc.Unsafe is ready to use.	2015-10-21 09:04:13 +02:00
Norman Maurer	f30a51b905	Correctly handle byte shifting if system does not support unaligned access. Motivation: We had a bug in our implemention which double "reversed" bytes on systems which not support unaligned access. Modifications: - Correctly only reverse bytes if needed. - Share code between unsafe implementations. Result: No more data-corruption on sytems without unaligned access.	2015-10-20 17:32:13 +02:00
Matteo Merli	27c68647df	In (Pooled\|Unpooled)UnsafeDirectByteBuf copy memory directly to and from ByteBuffer Motivation: When moving bytes between a PooledUnsafeDirectByteBuf or an UnpooledUnsafeDirectByteBuf and a ByteBuffer, a temp ByteBuffer is allocated and will need to be GCed. This is a common case since a ByteBuffer is always needed when reading/writing on a file, for example. Modifications: Use PlatformDependent.copyMemory() to avoid the need for the temp ByteBuffer Result: No temp ByteBuffer allocated and GCed.	2015-10-19 22:24:07 +02:00
Norman Maurer	5a6238ed4c	Minimize reference count checks in SlicedByteBuf Motivation: SlicedByteBuf did double reference count checking for various bulk operations, which affects performance. Modifications: - Add package private method to AbstractByteBuf that can be used to check indexes without check the reference count - Use this new method in the bulk operation os SlicedByteBuf as the reference count checks take place on the wrapped buffer anyway - Fix test-case to not try to read data that is out of the bounds of the buffer. Result: Better performance on bulk operations when using SlicedByteBuf (and sub-classes)	2015-10-16 21:09:03 +02:00
Norman Maurer	8f13e333dd	Always return a real slice even when the length is 0 Motivation: We need to always return a real slice even when the requested length is 0. This is needed as otherwise we not correctly share the reference count and so may leak a buffer if the user call release() on the returned slice and expect it to decrement the reference count of the "parent" buffer. Modifications: - Always return a real slice - Add unit test for the bug. Result: No more leak possible when a user requests a slice of length 0 of a SlicedByteBuf.	2015-10-16 20:46:05 +02:00
Norman Maurer	4c287d4e27	Added SlicedAbstractByteBuf that can provide fast-path for _get* and _set* methods Motivation: SlicedByteBuf can be used for any ByteBuf implementations and so can not do any optimizations that could be done when AbstractByteBuf is sliced. Modifications: - Add SlicedAbstractByteBuf that can eliminate range and reference count checks for _get* and _set* methods. Result: Faster SlicedByteBuf implementations for AbstractByteBuf sub-classes.	2015-10-16 09:12:20 +02:00
Norman Maurer	8c93f4b1ef	Added DuplicatedAbstractByteBuf that can provide fast-path for _get* and _set* methods Motivation: DuplicatedByteBuf can be used for any ByteBuf implementations and so can not do any optimizations that could be done when AbstractByteBuf is duplicated. Modifications: - Add DuplicatedAbstractByteBuf that can eliminate range and reference count checks for _get* and _set* methods. Result: Faster DuplicatedByteBuf implementations for AbstractByteBuf sub-classes.	2015-10-16 08:56:35 +02:00
Norman Maurer	2aef4a504f	Minimize object allocation when calling AbstractByteBuf.toString(..., Charset) Motivation: Calling AbstractByteBuf.toString(..., Charset) is used quite frequently by users but produce a lot of GC. Modification: - Use a FastThreadLocal to store the CharBuffer that are needed for decoding. - Use internalNioBuffer(...) when possible Result: Less object creation / Less GC	2015-10-15 17:51:57 +02:00
Norman Maurer	9697afc106	Allow to disable reference count checks on every access of the ByteBuf Motiviation: Checking reference count on every access on a ByteBuf can have some big performance overhead depending on how the access pattern is. If the user is sure that there are no reference count errors on his side it should be possible to disable the check and so gain the max performance. Modification: - Add io.netty.buffer.bytebuf.checkAccessible system property which allows to disable the checks. Enabled by default. - Add microbenchmark Result: Increased performance for operations on the ByteBuf.	2015-10-15 10:21:16 +02:00
Norman Maurer	ffe7aafd82	Optimize and minimize bound checks Motivation: We should minimize and optimize bound checks as much as possible to get the most out of performance. Modifications: - Use bitwise operations to remove branching - Remove branches when possible Result: Better performance for various operations.	2015-10-15 10:18:13 +02:00
Norman Maurer	30dc1c1fa4	[#4313 ] ByteBufUtil.writeUtf8 should use fast-path for WrappedByteBuf Motivation: ByteBufUtil.writeUtf8(...) / writeUsAscii(...) can use a fast-path when writing into AbstractByteBuf. We should try to unwrap WrappedByteBuf implementations so we are able to do the same on wrapped AbstractByteBuf instances. Modifications: - Try to unwrap WrappedByteBuf to use the fast-path Result: Faster writing of utf8 and usascii for WrappedByteBuf instances.	2015-10-13 12:00:37 +02:00
Norman Maurer	99b11c95b4	[#4327 ] Ensure toString() will not throw IllegalReferenceCountException Motivation: As toString() is often used while logging we need to ensure this produces no exception. Modifications: Ensure we never throw an IllegalReferenceCountException. Result: Be able to log without produce exceptions.	2015-10-10 20:12:43 +02:00
Norman Maurer	956a757d37	[#3789 ] Correctly reset markers for all allocations when using PooledByteBufAllocator Motivation: We need to ensure all markers are reset when doing an allocation via the PooledByteBufAllocator. This was not the always the case. Modifications: Move all logic that needs to get executed when reuse a PooledByteBuf into one place and call it. Result: Correct behavior	2015-09-25 19:57:33 +02:00
Norman Maurer	cac51ab8d6	Optimize ByteBufUtil.writeUsAscii(...) when AsciiString is used. Motivation: When AsciiString is used we can optimize the write operation done by ByteBufUtil.writeUsAscii(...) Modifications: Sepcial handle AsciiString. Result: Faster writing of AsciiString.	2015-09-15 12:26:58 +02:00
Matteo Merli	7a9a3159f9	Added debug logging with effective value for io.netty.leakDetection.acquireAndReleaseOnly property Motivation: The configurable property value recently added was not logged like others properties. Modifications: Added debug log with effective value applied. Result: Consistent with other properties	2015-09-01 09:10:38 +02:00
Matteo Merli	9b45e9d015	Additional configuration for leak detection Motivation: Leak detector, when it detects a leak, will print the last 5 stack traces that touched the ByteBuf. In some cases that might not be enough to identify the root cause of the leak. Also, sometimes users might not be interested in tracing all the operations on the buffer, but just the ones that are affecting the reference count. Modifications: Added command line properties to override default values: * Allow to configure max number of stack traces to collect * Allow to only record retain/release operation on buffers Result: Users can increase the number of stack traces to debug buffer leaks with lot of retain/release operations.	2015-08-30 20:50:10 +02:00
Scott Mitchell	4bdd8dacb9	Restore derived buffer index/mark updates Motivation: As part of the revert process in https://github.com/netty/netty/pull/4138 some index and mark updates were lost. Modifications: - Restore the index / mark updates made in https://github.com/netty/netty/pull/3788 Result: Slice and Duplicate buffers index / marks are correctly initialized.	2015-08-27 10:25:15 -07:00
Scott Mitchell	e280251b15	Revert "Add PooledSlicedByteBuf and PooledDuplicatedByteBuf" Motivation: Currently the "derived" buffer will only ever be recycled if the release call is made on the "derived" object, and the "wrapped" buffer ends up being "fully released" (aka refcount goes to 0). From my experience this is not the common use case and thus the "derived" buffers will not be recycled. Modifications: - revert https://github.com/netty/netty/pull/3788 Result: Less complexity, and less code to create new objects in majority of cases.	2015-08-26 13:24:44 -07:00
Matteo Merli	0be53f296f	MemoryRegionCache$Entry objects are not recycled Motivation: Even though MemoryRegionCache$Entry instances are allocated through a recycler they are not properly recycled, leaving a lot of instances to be GCed along with Recycler$DefaultHandle objects. Fixes #4071 Modification: Recycle Entry when done using it. Result: Less GCed objects.	2015-08-10 21:29:25 +02:00
Scott Mitchell	cf171ff525	maxBytesPerRead channel configuration Motiviation: The current read loops don't fascilitate reading a maximum amount of bytes. This capability is useful to have more fine grain control over how much data is injested. Modifications: - Add a setMaxBytesPerRead(int) and getMaxBytesPerRead() to ChannelConfig - Add a setMaxBytesPerIndividualRead(int) and getMaxBytesPerIndividualRead to ChannelConfig - Add methods to RecvByteBufAllocator so that a pluggable scheme can be used to control the behavior of the read loop. - Modify read loop for all transport types to respect the new RecvByteBufAllocator API Result: The ability to control how many bytes are read for each read operation/loop, and a more extensible read loop.	2015-08-05 23:59:54 -07:00
Norman Maurer	ae163d687d	[#3896 ] Unpooled.copiedBuffer(ByteBuffer) and copiedBuffer(ByteBuffer...) is not thread-safe. Motivation: As we modify the position of the passed in ByteBuffer's this methods are not thread-safe. Modifications: Duplicate the input ByteBuffers before copy the content to byte[]. Result: Unpooled.copiedBuffer(ByteBuffer) and copiedBuffer(ByteBuffer...) is now thread-safe.	2015-07-07 08:38:37 +02:00
Norman Maurer	8c90d602d7	[#3899 ] Fix javadoc to use netty 4 API. Motivation: The javadoc of ByteBuf contained some out-dated code. Modifications: Update code example in javadoc to use netty 4+ API Result: Correct javadocs	2015-07-03 14:18:53 +02:00
Louis Ryan	ba6319eb6c	Fix FixedCompositeByteBuf handling when copying to direct buffers and streams Motivation: FixedCompositeByteBuf does not properly implement a number of methods for copying its content to direct buffers and output streams Modifications: Replace improper use of capacity() with readableBytes() when computing offesets during writes Result: Copying works correctly	2015-06-27 20:39:31 +02:00
Norman Maurer	e0ef01cf93	[#3888 ] Use 2 * cores as default minimum for pool arenas. Motivation: At the moment we use 1 * cores as default mimimum for pool arenas. This can easily lead to conditions as we use 2 * cores as default for EventLoop's when using NIO or EPOLL. If we choose a smaller number we will run into hotspots as allocation and deallocation needs to be synchronized on the PoolArena. Modifications: Change the default number of arenas to 2 * cores. Result: Less conditions when using the default settings.	2015-06-18 07:27:30 +02:00
Norman Maurer	4570f30dd9	[#3798 ] Extract dump method to ByteBufUtil Motivation: Dumping the content of a ByteBuf in a hex format is very useful. Modifications: Move code into ByteBufUtil so its easy to reuse. Result: Easy to reuse dumping code.	2015-06-09 06:21:09 +02:00
Norman Maurer	6f9eb2cd34	Update javadocs to highlight that derived buffers will not increment the reference count. Motivation: We not explain the derived buffers will not retain the parent buffer. Modifications: Add docs. Result: Correctly document behaviour	2015-06-06 19:54:38 +02:00
Norman Maurer	9f5a3e553c	Fix regression introduced by `f765053ae7` by use Entry after it is recycled	2015-05-27 16:56:20 +02:00
Norman Maurer	81fee66c78	Let PoolThreadCache work even if allocation and deallocation Thread are different Motivation: PoolThreadCache did only cache allocations if the allocation and deallocation Thread were the same. This is not optimal as often people write from differen thread then the actual EventLoop thread. Modification: - Add MpscArrayQueue which was forked from jctools and lightly modified. - Use MpscArrayQueue for caches and always add buffer back to the cache that belongs to the allocation thread. Result: ThreadPoolCache is now also usable and so gives performance improvements when allocation and deallocation thread are different. Performance when using same thread for allocation and deallocation is noticable worse then before.	2015-05-27 14:38:11 +02:00
Norman Maurer	ab1bb9136b	[#3654 ] Synchronize on PoolSubpage head when allocate / free PoolSubpages Motivation: Currently we hold a lock on the PoolArena when we allocate / free PoolSubpages, which is wasteful as this also affects "normal" allocations. The same is true vice-verse. Modifications: Ensure we synchronize on the head of the PoolSubPages pool. This is done per size and so it is possible to concurrently allocate / deallocate PoolSubPages with different sizes, and also normal allocations. Result: Less condition and so faster allocation/deallocation. Before this commit: xxx:~/wrk $ ./wrk -H 'Connection: keep-alive' -d 120 -c 256 -t 16 -s scripts/pipeline-many.lua http://xxx:8080/plaintext Running 2m test @ http://xxx:8080/plaintext 16 threads and 256 connections Thread Stats Avg Stdev Max +/- Stdev Latency 17.61ms 29.52ms 689.73ms 97.27% Req/Sec 278.93k 41.97k 351.04k 84.83% 530527460 requests in 2.00m, 71.64GB read Requests/sec: 4422226.13 Transfer/sec: 611.52MB After this commit: xxx:~/wrk $ ./wrk -H 'Connection: keep-alive' -d 120 -c 256 -t 16 -s scripts/pipeline-many.lua http://xxx:8080/plaintext Running 2m test @ http://xxx:8080/plaintext 16 threads and 256 connections Thread Stats Avg Stdev Max +/- Stdev Latency 15.85ms 24.50ms 681.61ms 97.42% Req/Sec 287.14k 38.39k 360.33k 85.88% 547902773 requests in 2.00m, 73.99GB read Requests/sec: 4567066.11 Transfer/sec: 631.55MB This is reproducable every time.	2015-05-27 10:33:12 +02:00
Norman Maurer	dce0dd9b78	[#3654 ] No need to hold lock while destroy a chunk Motiviation: At the moment we sometimes hold the lock on the PoolArena during destroy a PoolChunk. This is not needed. Modification: - Ensure we not hold the lock during destroy a PoolChunk - Move all synchronized usage in PoolArena - Cleanup Result: Less condition.	2015-05-27 09:47:53 +02:00
Norman Maurer	271af7c624	Expose metrics for PooledByteBufAllocator Motivation: The PooledByteBufAllocator is more or less a black-box atm. We need to expose some metrics to allow the user to get a better idea how to tune it. Modifications: - Expose different metrics via PooledByteBufAllocator - Add *Metrics interfaces Result: It is now easy to gather metrics and detail about the PooledByteBufAllocator and so get a better understanding about resource-usage etc.	2015-05-20 21:06:17 +02:00
Norman Maurer	88b8558ec8	Add PooledSlicedByteBuf and PooledDuplicatedByteBuf Motivation: At the moment when calling slice(...) or duplicate(...) on a Pooled*ByteBuf a new SlicedByteBuf or DuplicatedByteBuf. This can create a lot of GC. Modifications: Add PooledSlicedByteBuf and PooledDuplicatedByteBuf which will be used when a PooledByteBuf is used. Result: Less GC.	2015-05-20 11:37:50 +02:00
Norman Maurer	33e443e71a	Clarify ByteBuf.duplicate() semantics. Motivation: From the javadocs of ByteBuf.duplicate() it is not clear if the reader and writer marks will be duplicated. Modifications: Add sentence to clarify that marks will not be duplicated. Result: Clear semantics.	2015-05-20 08:32:52 +02:00
Norman Maurer	9d568586db	Reset markers when obtain PooledByteBuf. Motivation: When allocate a PooledByteBuf we need to ensure to also reset the markers for the readerIndex and writerIndex. Modifications: - Correct reset the markers - Add test-case for it Result: Correctly reset markers.	2015-05-20 07:29:32 +02:00
Norman Maurer	92bfeeca1b	No need to release lock and acquire again when allocate normal size. Motiviation: When tried to allocate tiny and small sized and failed to serve these out of the PoolSubPage we exit the synchronization block just to enter it again when call allocateNormal(...). Modification: Not exit the synchronized block until allocateNormal(...) is done. Result: Better performance.	2015-05-20 07:29:32 +02:00
Alwayswithme	79c17cf1fd	ByteBufUtil use IndexOfProcessor to find occurrence. Motivation: The way of firstIndexOf and lastIndexOf iterating the ByteBuf is similar to forEachByte and forEachByteDesc, but have many range checks. Modifications: Use forEachByte and a IndexOfProcessor to find occurrence. Result: eliminate range checks	2015-05-07 09:36:47 +02:00
Scott Mitchell	f812180c2d	ByteString arrayOffset method Motivation: The ByteString class currently assumes the underlying array will be a complete representation of data. This is limiting as it does not allow a subsection of another array to be used. The forces copy operations to take place to compensate for the lack of API support. Modifications: - add arrayOffset method to ByteString - modify all ByteString and AsciiString methods that loop over or index into the underlying array to use this offset - update all code that uses ByteString.array to ensure it accounts for the offset - add unit tests to test the implementation respects the offset Result: ByteString and AsciiString can represent a sub region of a byte[].	2015-04-24 18:54:01 -07:00
Norman Maurer	8b1f247a1a	[#3623 ] CompositeByteBuf.iterator() should return optimized Iterable Motivation: CompositeByteBuf.iterator() currently creates a new ArrayList and fill it with the ByteBufs, which is more expensive then it needs to be. Modifications: - Use special Iterator implementation Result: Less overhead when calling iterator()	2015-04-20 10:45:37 +02:00
Scott Mitchell	9a7a85dbe5	ByteString introduced as AsciiString super class Motivation: The usage and code within AsciiString has exceeded the original design scope for this class. Its usage as a binary string is confusing and on the verge of violating interface assumptions in some spots. Modifications: - ByteString will be created as a base class to AsciiString. All of the generic byte handling processing will live in ByteString and all the special character encoding will live in AsciiString. Results: The AsciiString interface will be clarified. Users of AsciiString can now be clear of the limitations the class imposes while users of the ByteString class don't have to live with those limitations.	2015-04-14 16:35:17 -07:00
garywu	4d02c3a040	[#2925 ] Bug fix for NormalMemoryRegionCache overbooked for PoolThreadCache Motivation: When create NormalMemoryRegionCache for PoolThreadCache, we overbooked cache array size. This means unnecessary overhead for thread local cache as we will create multi cache enties for each element in cache array. Modifications: change: int arraySize = Math.max(1, max / area.pageSize); to: int arraySize = Math.max(1, log2(max / area.pageSize) + 1); Result: Now arraySize won't introduce unnecessary overhead. Changes to be committed: modified: buffer/src/main/java/io/netty/buffer/PoolThreadCache.java	2015-04-13 09:02:10 +02:00
Norman Maurer	18627749a9	Let CompositeByteBuf implement Iterable Motivation: CompositeByteBuf has an iterator() method but fails to implement Iterable Modifications: Let CompositeByteBuf implement Iterable<ByteBuf> Result: Easier usage	2015-04-12 13:38:27 +02:00
Norman Maurer	d8e5d421e1	Revert "Dereference when calling PooledByteBuf.deallocate()" This reverts commit `7094c7b797`.	2015-04-11 06:44:32 +02:00
Norman Maurer	7094c7b797	Dereference when calling PooledByteBuf.deallocate() Motivation: We missed to dereference the chunk and tmpNioBuf when calling deallocate(). This means the GC can not collect these as we still hold a reference while have the PooledByteBuf in the recycler stack. Modifications: Dereference chunk and tmpNioBuf. Result: GC can collect things.	2015-04-10 21:47:14 +02:00
Norman Maurer	3e42292d8b	Change PoolThreadCache to use LIFO for better cache performance Motiviation: At the moment we use FIFO for the PoolThreadCache which is sub-optimal as this may reduce the changes to have the cached memory actual still in the cpu-cache. Modification: - Change to use LIFO as this increase the chance to be able to serve buffers from the cpu-cache Results: Faster allocation out of the ThreadLocal cache. Before the commit: [xxx wrk]$ ./wrk -H 'Connection: keep-alive' -d 120 -c 256 -t 16 -s scripts/pipeline-many.lua http://xxx:8080/plaintext Running 2m test @ http://xxx:8080/plaintext 16 threads and 256 connections Thread Stats Avg Stdev Max +/- Stdev Latency 14.69ms 10.06ms 131.43ms 80.10% Req/Sec 283.89k 40.37k 433.69k 66.81% 533859742 requests in 2.00m, 72.09GB read Requests/sec: 4449510.51 Transfer/sec: 615.29MB After the commit: [xxx wrk]$ ./wrk -H 'Connection: keep-alive' -d 120 -c 256 -t 16 -s scripts/pipeline-many.lua http://xxx:8080/plaintext Running 2m test @ http://xxx:8080/plaintext 16 threads and 256 connections Thread Stats Avg Stdev Max +/- Stdev Latency 16.38ms 26.32ms 734.06ms 97.38% Req/Sec 283.86k 39.31k 361.69k 83.38% 540836511 requests in 2.00m, 73.04GB read Requests/sec: 4508150.18 Transfer/sec: 623.40MB	2015-04-10 20:57:54 +02:00
Scott Mitchell	0d3a6e0511	HTTP/2 Decoder reduce preface conditional checks Motivation: The DefaultHttp2ConnectionDecoder class is calling verifyPrefaceReceived() for almost every frame event at all times. The Http2ConnectionHandler class is calling readClientPrefaceString() on every decode event. Modifications: - DefaultHttp2ConnectionDecoder should not have to continuously call verifyPrefaceReceived() because it transitions boolean state 1 time for each connection. - Http2ConnectionHandler should not have to continuously call readClientPrefaceString() because it transitions boolean state 1 time for each connection. Result: - Less conditional checks for the mainstream usage of the connection.	2015-03-28 18:52:35 -07:00
Leo Gomes	4500adb6e0	Updates the javadoc of Unpooled to remove mention to methods it does not provide Motivation: `Unpooled` javadoc's mentioned the generation of hex dump and swapping an integer's byte order, which are actually provided by `ByteBufUtil`. Modifications: Sentence moved to `ByteBufUtil` javadoc. Result: `Unpooled` javadoc is correct.	2015-03-04 12:04:14 +09:00
Norman Maurer	41fd857a7c	Ensure CompositeByteBuf.addComponent* handles buffer in consistent way and not causes leaks Motivation: At the moment we have two problems: - CompositeByteBuf.addComponent(...) will not add the supplied buffer to the CompositeByteBuf if its empty, which means it will not be released on CompositeByteBuf.release() call. This is a problem as a user will expect everything added will be released (the user not know we not added it). - CompositeByteBuf.addComponents(...) will either add no buffers if none is readable and so has the same problem as addComponent(...) or directly release the ByteBuf if at least one ByteBuf is readable. Again this gives inconsistent handling and may lead to memory leaks. Modifications: - Always add the buffer to the CompositeByteBuf and so release it on release call. Result: Consistent handling and no buffer leaks.	2015-02-12 16:09:41 +01:00
Trustin Lee	155c0e2f36	Implement internal memory access methods of CompositeByteBuf correctly Motivation: When a CompositeByteBuf is empty (i.e. has no component), its internal memory access operations do not always behave as expected. Modifications: Check if the nunmber of components is zero. If so, return an empty array or an empty NIO buffer, etc. Result: More robustness	2014-12-30 15:56:53 +09:00
Trustin Lee	a666acce6d	Add more tests to EmptyByteBufTest - Ensure an EmptyByteBuf has an array, an NIO buffer, and a memory address at the same time - Add an assertion that checks if EMPTY_BUFFER is an EmptyByteBuf, just in case we make a mistake in the future	2014-12-30 15:51:45 +09:00
Norman Maurer	fe796fc8ab	Provide helper methods in ByteBufUtil to write UTF-8/ASCII CharSequences. Related to [#909 ] Motivation: We expose no methods in ByteBuf to directly write a CharSequence into it. This leads to have the user either convert the CharSequence first to a byte array or use CharsetEncoder. Both cases have some overheads and we can do a lot better for well known Charsets like UTF-8 and ASCII. Modifications: Add ByteBufUtil.writeAscii(...) and ByteBufUtil.writeUtf8(...) which can do the task in an optimized way. This is especially true if the passed in ByteBuf extends AbstractByteBuf which is true for all of our implementations which not wrap another ByteBuf. Result: Writing an ASCII and UTF-8 CharSequence into a AbstractByteBuf is a lot faster then what the user could do by himself as we can make use of some package private methods and so eliminate reference and range checks. When the Charseq is not ASCII or UTF-8 we can still do a very good job and are on par in most of the cases with what the user would do. The following benchmark shows the improvements: Result: 2456866.966 ?(99.9%) 59066.370 ops/s [Average] Statistics: (min, avg, max) = (2297025.189, 2456866.966, 2586003.225), stdev = 78851.914 Confidence interval (99.9%): [2397800.596, 2515933.336] Benchmark Mode Samples Score Score error Units i.n.m.b.ByteBufUtilBenchmark.writeAscii thrpt 50 9398165.238 131503.098 ops/s i.n.m.b.ByteBufUtilBenchmark.writeAsciiString thrpt 50 9695177.968 176684.821 ops/s i.n.m.b.ByteBufUtilBenchmark.writeAsciiStringViaArray thrpt 50 4788597.415 83181.549 ops/s i.n.m.b.ByteBufUtilBenchmark.writeAsciiStringViaArrayWrapped thrpt 50 4722297.435 98984.491 ops/s i.n.m.b.ByteBufUtilBenchmark.writeAsciiStringWrapped thrpt 50 4028689.762 66192.505 ops/s i.n.m.b.ByteBufUtilBenchmark.writeAsciiViaArray thrpt 50 3234841.565 91308.009 ops/s i.n.m.b.ByteBufUtilBenchmark.writeAsciiViaArrayWrapped thrpt 50 3311387.474 39018.933 ops/s i.n.m.b.ByteBufUtilBenchmark.writeAsciiWrapped thrpt 50 3379764.250 66735.415 ops/s i.n.m.b.ByteBufUtilBenchmark.writeUtf8 thrpt 50 5671116.821 101760.081 ops/s i.n.m.b.ByteBufUtilBenchmark.writeUtf8String thrpt 50 5682733.440 111874.084 ops/s i.n.m.b.ByteBufUtilBenchmark.writeUtf8StringViaArray thrpt 50 3564548.995 55709.512 ops/s i.n.m.b.ByteBufUtilBenchmark.writeUtf8StringViaArrayWrapped thrpt 50 3621053.671 47632.820 ops/s i.n.m.b.ByteBufUtilBenchmark.writeUtf8StringWrapped thrpt 50 2634029.071 52304.876 ops/s i.n.m.b.ByteBufUtilBenchmark.writeUtf8ViaArray thrpt 50 3397049.332 57784.119 ops/s i.n.m.b.ByteBufUtilBenchmark.writeUtf8ViaArrayWrapped thrpt 50 3318685.262 35869.562 ops/s i.n.m.b.ByteBufUtilBenchmark.writeUtf8Wrapped thrpt 50 2473791.249 46423.114 ops/s Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 1,387.417 sec - in io.netty.microbench.buffer.ByteBufUtilBenchmark Results : Tests run: 1, Failures: 0, Errors: 0, Skipped: 0 Results : Tests run: 1, Failures: 0, Errors: 0, Skipped: 0 The ViaArray benchmarks are basically doing a toString().getBytes(Charset) which the others are using ByteBufUtil.write*(...).	2014-12-26 15:58:18 +09:00
Norman Maurer	66294892a0	CompositeByteBuf.nioBuffers(...) must not return an empty ByteBuffer array Motivation: CompositeByteBuf.nioBuffers(...) returns an empty ByteBuffer array if the specified length is 0. This is not consistent with other ByteBuf implementations which return an ByteBuffer array of size 1 with an empty ByteBuffer included. Modifications: Make CompositeByteBuf.nioBuffers(...) consistent with other ByteBuf implementations. Result: Consistent and correct behaviour of nioBufffers(...)	2014-12-22 11:18:32 +01:00
Norman Maurer	a69a39c849	Always return SliceByteBuf on slice(...) to eliminate possible leak Motivation: When calling slice(...) on a ByteBuf the returned ByteBuf should be the slice of a ByteBuf and shares it's reference count. This is important as it is perfect legal to use buf.slice(...).release() and have both, the slice and the original ByteBuf released. At the moment this is only the case if the requested slice size is > 0. This makes the behavior inconsistent and so may lead to a memory leak. Modifications: - Never return Unpooled.EMPTY_BUFFER when calling slice(...). - Adding test case for buffer.slice(...).release() and buffer.duplicate(...).release() Result: Consistent behaviour and so no more leaks possible.	2014-12-22 11:15:50 +01:00
Norman Maurer	182c91f06c	Ensure buffer is not released when call array() / memoryAddress() Motivation: Before we missed to check if a buffer was released before we return the backing byte array or memoryaddress. This could lead to JVM crashes when someone tried various bulk operations on the UnsafeByteBuf implementations. Modifications: Always check if the buffer is released before all to return the byte array and memoryaddress. Result: No more JVM crashes because of released buffers when doing bulk operations on UnsafeByteBuf implementations.	2014-12-11 11:30:31 +01:00
Idel Pivnitskiy	35db3c6710	Small performance improvements Motivation: Found performance issues via FindBugs and PMD. Modifications: - Removed unnecessary boxing/unboxing operations in DefaultTextHeaders.convertToInt(CharSequence) and DefaultTextHeaders.convertToLong(CharSequence). A boxed primitive is created from a string, just to extract the unboxed primitive value. - Added a static modifier for DefaultHttp2Connection.ParentChangedEvent class. This class is an inner class, but does not use its embedded reference to the object which created it. This reference makes the instances of the class larger, and may keep the reference to the creator object alive longer than necessary. - Added a static compiled Pattern to avoid compile it each time it is used when we need to replace some part of authority. - Improved using of StringBuilders. Result: Performance improvements.	2014-11-20 00:10:06 -05:00
Norman Maurer	48f1398869	Disable caching of PooledByteBuf for different threads. Motivation: We introduced a PoolThreadCache which is used in our PooledByteBufAllocator to reduce the synchronization overhead on PoolArenas when allocate / deallocate PooledByteBuf instances. This cache is used for both the allocation path and deallocation path by: - Look for cached memory in the PoolThreadCache for the Thread that tries to allocate a new PooledByteBuf and if one is found return it. - Add the memory that is used by a PooledByteBuf to the PoolThreadCache of the Thread that release the PooledByteBuf This works out very well when all allocation / deallocation is done in the EventLoop as the EventLoop will be used for read and write. On the otherside this can lead to surprising side-effects if the user allocate from outside the EventLoop and and pass the ByteBuf over for writing. The problem here is that the memory will be added to the PoolThreadCache that did the actual write on the underlying transport and not on the Thread that previously allocated the buffer. Modifications: Don't cache if different Threads are used for allocating/deallocating Result: Less confusing behavior for users that allocate PooledByteBufs from outside the EventLoop.	2014-09-22 13:39:31 +02:00
Norman Maurer	858de5699b	[#2924 ] Correctly update head in MemoryRegionCache.trim() Motivation: When MemoryRegionCache.trim() is called, some unused cache entries will be freed (started from head). However, in MeoryRegionCache.trim() the head is not updated, which make entry list's head point to an entry whose chunk is null now and following allocate of MeoryRegionCache will return false immediately. In other word, cache is no longer usable once trim happen. Modifications: Update head to correct idx after free entries in trim(). Result: MemoryRegionCache behaves correctly even after calling trim().	2014-09-22 11:04:21 +02:00
Trustin Lee	0dc6a8dccf	Use heap buffers for Unpooled.copiedBuffer() Related issue: #2028 Motivation: Some copiedBuffer() methods in Unpooled allocated a direct buffer. An allocation of a direct buffer is an expensive operation, and thus should be avoided for unpooled buffers. Modifications: - Use heap buffers in all copiedBuffer() methods Result: Unpooled.copiedBuffers() are less expensive now.	2014-08-13 15:10:11 -07:00
Norman Maurer	ef572d859d	Change back default allocator to pooled. Motivation: While porting some changes from 4.0 to 4.1 and master branch I changed the default allocator from pooled to unpooled by mistake. This should be reverted. The guilty commit is `4a3ef90381`. Thanks to @blucas for spotting this. Modifications: Revert changes related to allocator. Result: Use the correct default allocator again.	2014-08-13 12:07:06 +02:00
Norman Maurer	869687bd71	Port ChannelOutboundBuffer and related changes from 4.0 Motivation: We did various changes related to the ChannelOutboundBuffer in 4.0 branch. This commit port all of them over and so make sure our branches are synced in terms of these changes. Related to [#2734], [#2709], [#2729], [#2710] and [#2693] . Modification: Port all changes that was done on the ChannelOutboundBuffer. This includes the port of the following commits: - `73dfd7c01b` - `997d8c32d2` - `e282e504f1` - `5e5d1a58fd` - `8ee3575e72` - `d6f0d12a86` - `16e50765d1` - `3f3e66c31a` Result: - Less memory usage by ChannelOutboundBuffer - Same code as in 4.0 branch - Make it possible to use ChannelOutboundBuffer with Channel implementation that not extends AbstractChannel	2014-08-05 15:00:45 +02:00
Trustin Lee	9a654d8a61	Remove duplicate range check in AbstractByteBuf.skipBytes()	2014-07-29 15:58:28 -07:00
Idel Pivnitskiy	ad1389be9d	Small performance improvements Modifications: - Added a static modifier for CompositeByteBuf.Component. This class is an inner class, but does not use its embedded reference to the object which created it. This reference makes the instances of the class larger, and may keep the reference to the creator object alive longer than necessary. - Removed unnecessary boxing/unboxing operations in HttpResponseDecoder, RtspResponseDecoder, PerMessageDeflateClientExtensionHandshaker and PerMessageDeflateServerExtensionHandshaker A boxed primitive is created from a String, just to extract the unboxed primitive value. - Removed unnecessary 3 times calculations in DiskAttribute.addContent(...). - Removed unnecessary checks if file exists before call mkdirs() in NativeLibraryLoader and PlatformDependent. Because the method mkdirs() has this check inside. - Removed unnecessary `instanceof AsciiString` check in StompSubframeAggregator.contentLength(StompHeadersSubframe) and StompSubframeDecoder.getContentLength(StompHeaders, long). Because StompHeaders.get(CharSequence) always returns java.lang.String.	2014-07-20 09:26:04 +02:00
Norman Maurer	f88dfd0430	[#2653 ] Remove unnecessary ensureAccessible() calls Motivation: I introduced ensureAccessible() class as part of `6c47cc9711` in some places. Unfortunally I also added some where these are not needed and so caused a performance regression. Modification: Remove calls where not needed. Result: Fixed performance regression.	2014-07-14 21:04:12 +02:00
Norman Maurer	93c306602a	[#2653 ] Remove uncessary range checks for performance reasons Motivation: I introduced range checks as part of `6c47cc9711` in some places. Unfortunally I also added some where these are not needed and so caused a performance regression. Modification: Remove range checks where not needed Result: Fixed performance regression.	2014-07-14 11:43:19 +02:00
Brendt Lucas	ac8ac59148	[#2642 ] CompositeByteBuf.deallocate memory/GC improvement Motivation: CompositeByteBuf.deallocate generates unnecessary GC pressure when using the 'foreach' loop, as a 'foreach' loop creates an iterator when looping. Modification: Convert 'foreach' loop into regular 'for' loop. Result: Less GC pressure (and possibly more throughput) as the 'for' loop does not create an iterator	2014-07-08 21:08:14 +02:00
Trustin Lee	d0912f2709	Fix most inspector warnings Motivation: It's good to minimize potentially broken windows. Modifications: Fix most inspector warnings from our profile Update IntObjectHashMap Result: Cleaner code	2014-07-02 19:55:07 +09:00
Norman Maurer	9594a81b95	[#2622 ] Correctly check reference count before try to work on the underlying memory Motivation: Because of how we use reference counting we need to check for the reference count before each operation that touches the underlying memory. This is especially true as we use sun.misc.Cleaner.clean() to release the memory ASAP when possible. Because of this the user may cause a SEGFAULT if an operation is called that tries to access the backing memory after it was released. Modification: Correctly check the reference count on all methods that access the underlying memory or expose it via a ByteBuffer. Result: Safer usage of ByteBuf	2014-06-30 07:14:25 +02:00
Trustin Lee	c0462c0c3b	Optimize PoolChunk - Using short[] for memoryMap did not improve performance. Reverting back to the original dual-byte[] structure in favor of simplicity. - Optimize allocateRun() which yields small performence improvement - Use local variable when member fields are accessed more than once	2014-06-26 17:06:10 +09:00
Trustin Lee	dbc011c3f4	Fix inspector warnings	2014-06-26 17:06:10 +09:00
Pavan Kumar	69a6ad940a	Improve the allocation algorithm in PoolChunk Motivation: Depth-first search is not always efficient for buddy allocation. Modification: Employ a new faster search algorithm with different memoryMap layout. Result: With thread-local cache disabled, we see a lot of performance improvment, especially when the size of the allocation is as small as the page size, which had the largest search space previously.	2014-06-26 17:06:10 +09:00
Norman Maurer	12a3e23e47	MessageToByteEncoder always starts with ByteBuf that use initalCapacity == 0 Motivation: MessageToByteEncoder always starts with ByteBuf that use initalCapacity == 0 when preferDirect is used. This is really wasteful in terms of performance as every first write into the buffer will cause an expand of the buffer itself. Modifications: - Change ByteBufAllocator.ioBuffer() use the same default initialCapacity as heapBuffer() and directBuffer() - Add new allocateBuffer method to MessageToByteEncoder that allow the user to do some smarter allocation based on the message that will be encoded. Result: Less expanding of buffer and more flexibilty when allocate the buffer for encoding.	2014-06-24 13:55:21 +09:00
Trustin Lee	37b07a04d4	Revert "Improve the allocation algorithm in PoolChunk" This reverts commit `36305d7dce`, which seems to cause an assertion failure on our CI machine.	2014-06-21 19:19:35 +09:00
Pavan Kumar	6bd8c5d4d0	Improve the allocation algorithm in PoolChunk Motivation: Depth-first search is not always efficient for buddy allocation. Modification: Employ a new faster search algorithm with different memoryMap layout. Result: With thread-local cache disabled, we see a lot of performance improvment, especially when the size of the allocation is as small as the page size, which had the largest search space previously: -- master head -- Benchmark (size) Mode Score Error Units pooledDirectAllocAndFree 8192 thrpt 215.392 1.565 ops/ms pooledDirectAllocAndFree 16384 thrpt 594.625 2.154 ops/ms pooledDirectAllocAndFree 65536 thrpt 1221.520 18.965 ops/ms pooledHeapAllocAndFree 8192 thrpt 217.175 1.653 ops/ms pooledHeapAllocAndFree 16384 thrpt 587.250 14.827 ops/ms pooledHeapAllocAndFree 65536 thrpt 1217.023 44.963 ops/ms -- changes -- Benchmark (size) Mode Score Error Units pooledDirectAllocAndFree 8192 thrpt 3656.744 94.093 ops/ms pooledDirectAllocAndFree 16384 thrpt 4087.152 22.921 ops/ms pooledDirectAllocAndFree 65536 thrpt 4058.814 29.276 ops/ms pooledHeapAllocAndFree 8192 thrpt 3640.355 44.418 ops/ms pooledHeapAllocAndFree 16384 thrpt 4030.206 24.365 ops/ms pooledHeapAllocAndFree 65536 thrpt 4103.991 70.991 ops/ms	2014-06-21 13:20:25 +09:00
Norman Maurer	371f8066d2	[#2580 ] [#2587 ] Fix buffer corruption regression when ByteBuf.order(LITTLE_ENDIAN) is used Motivation: To improve the speed of ByteBuf with order LITTLE_ENDIAN and where the native order is also LITTLE_ENDIAN (intel) we introduces a new special SwappedByteBuf before in commit `4ad3984c8b`. Unfortunally the commit has a flaw which does not handle correctly the case when a ByteBuf expands. This was caused because the memoryAddress was cached and never changed again even if the underlying buffer expanded. This can lead to corrupt data or even to SEGFAULT the JVM if you are lucky enough. Modification: Always lookup the actual memoryAddress of the wrapped ByteBuf. Result: No more data-corruption for ByteBuf with order LITTLE_ENDIAN and no JVM crashes.	2014-06-20 18:24:44 +02:00
Trustin Lee	085a61a310	Refactor FastThreadLocal to simplify TLV management Motivation: When Netty runs in a managed environment such as web application server, Netty needs to provide an explicit way to remove the thread-local variables it created to prevent class loader leaks. FastThreadLocal uses different execution paths for storing a thread-local variable depending on the type of the current thread. It increases the complexity of thread-local removal. Modifications: - Moved FastThreadLocal and FastThreadLocalThread out of the internal package so that a user can use it. - FastThreadLocal now keeps track of all thread local variables it has initialized, and calling FastThreadLocal.removeAll() will remove all thread-local variables of the caller thread. - Added FastThreadLocal.size() for diagnostics and tests - Introduce InternalThreadLocalMap which is a mixture of hard-wired thread local variable fields and extensible indexed variables - FastThreadLocal now uses InternalThreadLocalMap to implement a thread-local variable. - Added ThreadDeathWatcher.unwatch() so that PooledByteBufAllocator tells it to stop watching when its thread-local cache has been freed by FastThreadLocal.removeAll(). - Added FastThreadLocalTest to ensure that removeAll() works - Added microbenchmark for FastThreadLocal and JDK ThreadLocal - Upgraded to JMH 0.9 Result: - A user can remove all thread-local variables Netty created, as long as he or she did not exit from the current thread. (Note that there's no way to remove a thread-local variable from outside of the thread.) - FastThreadLocal exposes more useful operations such as isSet() because we always implement a thread local variable via InternalThreadLocalMap instead of falling back to JDK ThreadLocal. - FastThreadLocalBenchmark shows that this change improves the performance of FastThreadLocal even more.	2014-06-19 21:13:55 +09:00
Norman Maurer	ad86ec798d	Move calculateNewCapacity(...) to ByteBufAllocator Motivation: Currently we have the algorithm of calculate the new capacity of a ByteBuf implemented in AbstractByteBuf. The problem with this is that it is impossible for a user to change it if it not fits well it's use-case. We should better move it to ByteBufAllocator and so let the user implement it's own by either write his/her own ByteBufAllocator or just override the default implementation in one of our provided ByteBufAllocators. Modifications: Move calculateNewCapacity(...) to ByteBufAllocator and move the implementation (which was part of AbstractByteBuf) to AbstractByteBufAllocator. Result: The user can now override the default calculation algorithm when needed.	2014-06-17 09:35:45 +02:00
Norman Maurer	066f95d047	[#2573 ] UnpooledUnsafeDirectByteBuf.setBytes(int,ByteBuf,int,int) fails to use fast-path when src has array Motivation: UnpooledUnsafeDirectByteBuf.setBytes(int,ByteBuf,int,int) fails to use fast-path when src uses an array as backing storage. This is because the if else uses the wrong ByteBuf for its check. Modifications: - Use correct ByteBuf when check for array as backing storage - Also eliminate unecessary check in UnpooledDirectByteBuf which always fails anyway Result: Faster setBytes(...) when src ByteBuf is backed by an array. No more IndexOutOfBoundsException or data-corruption.	2014-06-16 11:11:41 +02:00
belliottsmith	2a2a21ec59	Introduce FastThreadLocal which uses an EnumMap and a predefined fixed set of possible thread locals Motivation: Provide a faster ThreadLocal implementation Modification: Add a "FastThreadLocal" which uses an EnumMap and a predefined fixed set of possible thread locals (all of the static instances created by netty) that is around 10-20% faster than standard ThreadLocal in my benchmarks (and can be seen having an effect in the direct PooledByteBufAllocator benchmark that uses the DEFAULT ByteBufAllocator which uses this FastThreadLocal, as opposed to normal instantiations that do not, and in the new RecyclableArrayList benchmark); Result: Improved performance	2014-06-13 10:56:18 +02:00
Norman Maurer	61dbc353ca	[#2436 ] UnsafeByteBuf implementation should only invert bytes if ByteOrder differ from native ByteOrder Motivation: Our UnsafeByteBuf implementation always invert bytes when the native ByteOrder is LITTLE_ENDIAN (this is true on intel), even when the user calls order(ByteOrder.LITTLE_ENDIAN). This is not optimal for performance reasons as the user should be able to set the ByteOrder to LITTLE_ENDIAN and so write bytes without the extra inverting. Modification: - Introduce a new special SwappedByteBuf (called UnsafeDirectSwappedByteBuf) that is used by all the Unsafe*ByteBuf implementation and allows to write without inverting the bytes. - Add benchmark - Upgrade jmh to 0.8 Result: The user is be able to get the max performance even on servers that have ByteOrder.LITTLE_ENDIAN as their native ByteOrder.	2014-06-05 10:59:22 +02:00
Trustin Lee	7d9374a582	Use Java 5 foreach for arrays for brevity at no cost	2014-06-02 18:25:25 +09:00
Trustin Lee	af4c30fa56	Remove the deprecated constructor	2014-06-02 18:24:19 +09:00
Trustin Lee	e79ca269b8	Introduce ThreadDeathWatcher Motivation: PooledByteBufAllocator's thread local cache and ReferenceCountUtil.releaseLater() are in need of a way to run an arbitrary logic when a certain thread is terminated. Modifications: - Add ThreadDeathWatcher, which spawns a low-priority daemon thread that watches a list of threads periodically (every second) and invokes the specified tasks when the associated threads are not alive anymore - Start-stop logic based on CAS operation proposed by @tea-dragon - Add debug-level log messages to see if ThreadDeathWatcher works Result: - Fixes #2519 because we don't use GlobalEventExecutor anymore - Cleaner code	2014-06-02 18:23:23 +09:00
Trustin Lee	ea3dac0753	Do not use a pseudo random for tree traversal Motivation: If we make allocateRun/SubpageSimple() always try the left node first and make allocateRun/Subpage() always tries the right node first, it is more likely that allocateRun/Subpage() will find a node with ST_UNUSED sooner. Modifications: - Make allocateRunSimple() and allocateSubpageSimple() always try the left node first. - Make allocateRun() and allocateSubpage() always try the right node first. - Remove randome Result: We get the same performance without using random numbers.	2014-05-30 11:24:16 +09:00
Trustin Lee	e5ed69241b	Optimize PooledByteBufAllocator Motivation: We still have a room for improvement in PoolChunk.allocateRun() and Subpage.allocate(). Modifications: - Unroll the recursion in PoolChunk.allocateRun() - Subpage.allocate() makes use of the 'nextAvail' value set by previous free(). Result: - PoolChunk.allocateRun() optimization yields 10%+ improvements in allocation throughput for non-subpage allocations. - Subpage.allocate() optimization makes the subpage allocations for tiny buffers as fast as non-tiny buffers even when the pageSize is huge (e.g. 1048576) because it doesn't need to perform a linear search in most cases.	2014-05-30 10:51:21 +09:00
Jake Luciani	d547b5d51d	Fix capacity check bug affecting offheap buffers	2014-05-13 07:25:15 +02:00
Trustin Lee	db3709e652	Synchronized between 4.1 and master Motivation: 4 and 5 were diverged long time ago and we recently reverted some of the early commits in master. We must make sure 4.1 and master are not very different now. Modification: Fix found differences Result: 4.1 and master got closer.	2014-04-25 00:38:02 +09:00
ian	15d11289b0	Fix error that causes (up to) double memory usage Motivation: PoolArena's 'normalizeCapacity' function was micro-optimized some time ago to remove a while loop. However, there was a change of behavior in the function as a result. Capacities passed into it that are already powers of 2 (and >= 512) are doubled in size. So if I ask for a buffer with a capacity of 1024, I will get back one that actually uses 2048 bytes (stored in maxLength). Aligning to powers of two for book keeping ease is reasonable, and if someone tries to expand a buffer, you might as well use some of the previously wasted space. However, since this distinction between 'easily expanded' and 'costly to expand' space is not supported at all by the APIs, I cannot imagine this change to doubling is desirable or intentional. This is especially costly when using composite buffers. They frequently allocate components with a capacity that is a power of 2, and they never attempt to expand components themselves. The end result is that heavy use of pool-backed composite buffers wastes almost half of the memory pool (the smaller / initial components are <512 and so are not affected by the off-by-one bug). Modifications: Although I find it difficult to believe that such an optimization is really helpful, I left it in and fixed the off-by-one issue by decrementing the value at the start. I also added a simple test to both attempt to verify that the decrement fixes the issue without introducing any other change, and to make it easy for a reviewer to test the existing behavior. PoolArena does not seem to have much testing or testability support though so the test is kind of a hack and will break for unrelated changes. I suggest either removing it or factoring out the single non-static portion of normalizeCapacity so that the fragile dummy PoolArena is not required. Result: Pooled allocators will allocate less resources to the highly inefficient and undocumented buffer section between length and maxLength. Composite buffers of non-trivial size that are backed by pooled allocators will use about half as much memory.	2014-04-15 07:03:13 +02:00
Norman Maurer	ceffa82d0d	[#2370 ] Periodically check for not alive Threads and free up their ThreadPoolCache Motivation: At the moment we create new ThreadPoolCache whenever a Thread tries either allocate or release something on the PooledByteBufAllocator. When something is released we put it then in its ThreadPoolCache. The problem is we never check if a Thread is not alive anymore and so we may end up with memory that is never freed again if a user create many short living Threads that use the PooledByteBufAllocator. Modifications: Periodically check if the Thread is still alive that has a ThreadPoolCache assinged and if not free it. Result: Memory is freed up correctly even for short living Threads.	2014-04-09 11:45:11 +02:00
Norman Maurer	8429ecfcc4	Implement Thread caches for pooled buffers to minimize conditions. This fixes [#2264 ] and [#808 ]. Motivation: Remove the synchronization bottleneck in PoolArena and so speed up things Modifications: This implementation uses kind of the same technics as outlined in the jemalloc paper and jemalloc blogpost https://www.facebook.com/notes/facebook-engineering/scalable-memory-allocation-using-jemalloc/480222803919. At the moment we only cache for "known" Threads (that powers EventExecutors) and not for others to keep the overhead minimal when need to free up unused buffers in the cache and free up cached buffers once the Thread completes. Here we use multi-level caches for tiny, small and normal allocations. Huge allocations are not cached at all to keep the memory usage at a sane level. All the different cache configurations can be adjusted via system properties or the constructor directly where it makes sense. Result: Less conditions as most allocations can be served by the cache itself	2014-03-20 09:30:57 -07:00
Trustin Lee	19422972e3	Fix and simplify freeing a direct buffer / Fix Android support Motivation: `6e8ba291cf` introduced a regression in Android because Android does not have sun.nio.ch.DirectBuffer (see #2330.) I also found PlatformDependent0.freeDirectBuffer() and freeDirectBufferUnsafe() are pretty much same after the commit and the unsafe version should be removed. Modifications: - Do not use the pooled allocator in Android because it's too resource hungry for Androids. - Merge PlatformDependent0.freeDirectBuffer() and freeDirectBufferUnsafe() into one method. - Make the Unsafe unavailable when sun.nio.ch.DirectBuffer is unavailable. We could keep the Unsafe available and handle the sun.nio.ch.DirectBuffer case separately, but I don't want to complicate our code just because of that. All supported JDK versions have sun.nio.ch.DirectBuffer if the Unsafe is available. Result: Simpler code. Fixes Android support (#2330)	2014-03-20 11:11:07 +09:00
Jakob Buchgraber	1bce46dbb3	Bit tricks to check for and calculate power of two. Motivation: I was studying the code and thought this was simpler and easier to understand. Modifications: Replaced the for loop and if conditions, with a simple implementation. Result: Code is easier to understand.	2014-03-18 15:59:34 +09:00
Bourne, Geoff	1334d34e9d	Fix limit computation of NIO ByteBuffers obtained via ReadOnlyByteBufferBuf.nioBuffer Motivation: When starting with a read-only NIO buffer, wrapping it in a ByteBuf, and then later retrieving a re-wrapped NIO buffer the limit was getting too short. Modifications: Changed ReadOnlyByteBufferBuf.nioBuffer(int,int) to compute the limit in the same manner as the internalNioBuffer method. Result: Round-trip conversion from NIO to ByteBuf to NIO will work reliably.	2014-03-14 08:10:18 +01:00
Trustin Lee	fbd4385506	Determine the default allocator from system property - Add ByteBufAllocator.DEFAULT - The default allocator is now 'pooled'	2014-02-14 13:04:12 -08:00
Trustin Lee	dea5c688fd	Fix checkstyle	2014-02-13 18:51:32 -08:00
Trustin Lee	51349352e2	Fix a bug that CompositeByteBuf.touch() does nothing	2014-02-13 18:24:36 -08:00
Trustin Lee	8837afddf8	Enable a user specify an arbitrary information with ReferenceCounted.touch() - Related: #2163 - Add ResourceLeakHint to allow a user to provide a meaningful information about the leak when touching it - DefaultChannelHandlerContext now implements ResourceLeakHint to tell where the message is going. - Cleaner resource leak report by excluding noisy stack trace elements	2014-02-13 18:16:25 -08:00
Trustin Lee	45e70d9935	Add ReferenceCounted.touch() / Add missing retain() overrides - Fixes #2163 - Inspector warnings	2014-02-13 18:10:11 -08:00
Trustin Lee	2b84314fdd	Add Recycler.Handle.recycle() so that it's possible to recycle an object without an explicit reference to Recycler	2014-02-13 17:24:37 -08:00
Trustin Lee	26fc84529f	Also record retain() and release()	2014-02-13 16:58:45 -08:00
Norman Maurer	2351b8ddd9	Add FixedCompositeByteBuf which can be used to write an array of ByteBuf in an efficient way. This implementation does not produce as much GC pressure as CompositeByteBuf and so is prefered, for writing an array of ByteBufs. Be aware that FixedCompositeByteBuf is readonly. When using this in a project that make heavy use of CompositeByteBuf for writes we was able to cut down allocation to a half.	2014-02-13 16:52:31 -08:00
Trustin Lee	40003ed250	Resurrect Channel.id() with global uniqueness - Fixes #1810 - Add a new interface ChannelId and its default implementation which generates globally unique channel ID. - Replace AbstractChannel.hashCode with ChannelId.hashCode() and ChannelId.shortValue() - Add variants of ByteBuf.hexDump() which accept byte[] instead of ByteBuf.	2014-02-13 15:53:07 -08:00
Trustin Lee	ef4bc99849	Remove unnecessary parenthesis	2014-02-13 15:52:06 -08:00
Norman Maurer	f23d68b42f	[#2187 ] Always do a volatile read on the refCnt	2014-02-07 09:23:16 +01:00
Norman Maurer	9bee78f91c	Provide an optimized AtomicIntegerFieldUpdater, AtomicLongFieldUpdater and AtomicReferenceFieldUpdater	2014-02-06 20:08:45 +01:00
Trustin Lee	2102cb062b	Fix false-positive leaks - All derived buffers and swapped buffers of a leak-aware buffer must be wrapped again with the leak-aware buffer	2013-12-06 21:32:56 +09:00
Trustin Lee	128c4b96b5	Checkstyle	2013-12-06 13:54:36 +09:00
Trustin Lee	5d39b1fc3d	Also record retain() and release()	2013-12-06 13:45:24 +09:00
Trustin Lee	e88172495a	Ensure backward compatibility .. by resurrecting the removed methods and system properties.	2013-12-05 01:02:38 +09:00
Trustin Lee	65b522a2a7	Better buffer leak reporting - Remove the reference to ResourceLeak from the buffer implementations and use wrappers instead: - SimpleLeakAwareByteBuf and AdvancedLeakAwareByteBuf - It is now allocator's responsibility to create a leak-aware buffer. - Added AbstractByteBufAllocator.toLeakAwareBuffer() for easier implementation - Add WrappedByteBuf to reduce duplication between *LeakAwareByteBuf and UnreleasableByteBuf - Raise the level of leak reports to ERROR - because it will break the app eventually - Replace enabled/disabled property with the leak detection level - Only print stack trace when level is ADVANCED or above to avoid user confusion - Add the 'leak' build profile, which enables highly detailed leak reporting during the build - Remove ResourceLeakException which is unsed anymore	2013-12-05 00:51:39 +09:00
Norman Maurer	053c512f6d	Fix checkstyle	2013-12-02 08:23:57 +01:00
Norman Maurer	14600167d6	[#2021 ] No need to synchronize for unpooled chunks	2013-12-02 08:02:48 +01:00
Norman Maurer	7231be592a	Also allow to override how direct ByteBuffers are freed	2013-11-12 12:40:41 +01:00
Norman Maurer	e83fb821d5	Allow to override how wrapped direct ByteBuffer are allocated to make it easier to extend	2013-11-12 12:13:38 +01:00
Norman Maurer	b00f8c6390	[#1976 ] Fix IndexOutOfBoundsException when calling CompositeByteBuf.discardReadComponents()	2013-11-09 20:13:24 +01:00
Alex Petrov	e4f391f626	Improve docstrings for and of	2013-11-08 12:15:41 +01:00
Trustin Lee	ba3bc0c020	Simpler toString() for ByteBufAllocators	2013-11-08 17:54:34 +09:00
Norman Maurer	77b4ec7e1b	[#1800 ] [#1802 ] Correctly expand capacity of ByteBuf while preserve content	2013-11-04 15:18:21 +01:00
Trustin Lee	54db9ec725	Use StringUtil.simpleClassName(..) instead of Class.getSimpleName() where necessary - Class.getSimpleName() doesn't render anonymous classes very well - + some minor cleanup	2013-11-04 19:46:15 +09:00
Norman Maurer	4ce49a6195	[#1943 ] Unpooled.copiedBuffer(ByteBuf pooled) should always return unpooled ByteBuf	2013-10-22 20:20:38 +02:00
Norman Maurer	68b616728a	[#1925 ] Only expose sub-region of ByteBuf on nioBuffer(...)	2013-10-16 10:34:33 +02:00
Norman Maurer	d946659520	[#1906 ] Use a ByteBuf allocator from the ByteBufAllocator when encode Strings	2013-10-09 21:18:08 +02:00
Norman Maurer	1c73be21fc	Remove redundant index check	2013-10-08 07:21:01 +02:00
Norman Maurer	ee192f0321	[#1880 ] Use ByteBufAllocator when read bytes into new chunks	2013-10-01 10:10:43 +02:00
Norman Maurer	6d09e57be7	[#1875 ] Correctly check the readerIndex when try to read a byte from AbstractByteBuf	2013-09-30 14:47:49 +02:00
Norman Maurer	2b9a07cac9	CompositeByteBuf.isDirect() should return true if its only backed by direct buffers	2013-09-26 20:37:31 +02:00
Norman Maurer	a74149e984	[#1865 ] Only use internalNioBuffer when one of the read* or write* methods are used. This is neccessary to prevent races as those can happen when a slice or duplicate is shared between different Channels that are not assigned to the same EventLoop. In general get* operations should always be safe to be used from different Threads. This aslo include unit tests that show the issue	2013-09-25 17:27:26 +02:00
Norman Maurer	910ed31a1b	[#1851 ] EmptyByteBuf.isWritable(..) and isReadable(...) should not throw IndexOutOfBoundsException	2013-09-21 20:40:22 +02:00
Norman Maurer	23baef8fb4	[#1853 ] Optimize gathering writes for CompositeByteBuf that are only backed by one ByteBuffer	2013-09-19 07:29:58 +02:00
Norman Maurer	c0bbde48b7	[#1852 ] Fix bug in UnpooledDirectByteBuf.nioBuffer(...) implementation	2013-09-18 20:47:57 +02:00
Greg Soltis	f1d4f813ed	Fix nioBuffer implementation for CompositeByteBuf	2013-09-16 06:41:08 +02:00
Norman Maurer	451e91d142	[#1821 ] Fix IndexOutOfBoundsException which was thrown if the last component was removed but other components was left	2013-09-09 20:29:30 +02:00
Norman Maurer	25c226a835	Make sure only direct ByteBuffer are passed to the underlying jdk Channel. This is needed because of otherwise the JDK itself will do an extra ByteBuffer copy with it's own pool implementation. Even worth it will be done multiple times if the ByteBuffer is always only partial written. With this change the copy is done inside of netty using it's own allocator and only be done one time in all cases.	2013-09-02 20:17:53 +02:00
Norman Maurer	5416f2315e	[#1797 ] No use internalNioBuffer() in derived buffers as it is not meant for concurrent access	2013-09-02 14:15:19 +02:00
Norman Maurer	795182843d	Remove legancy code which we not need anymore as we use gathering writes anyway everywhere	2013-09-01 11:00:58 +02:00
Norman Maurer	5ddd7cee90	[#1797 ] Throw IllegalArgumentException if AbstractByteBuf.skipBytes(...) is used with a negative value	2013-08-29 11:14:36 +02:00
Trustin Lee	20894bc99e	Fix a bug in internalNioBuffer() implementations of derived buffers - A user can create multiple duplicates of a buffer and access their internal NIO buffers. (e.g. write multiple duplicates to multiple channels assigned to different event loop.) Because the derived buffers' internalNioBuffer() simply delegates the call to the original buffer, all derived buffers and the original buffer's internalNioBuffer() will return the same buffer, which will lead to a race condition. - Fixes #1739	2013-08-20 14:28:50 +09:00
bgallagher	9f88552f12	remove some dead code	2013-08-10 20:46:48 +02:00
Norman Maurer	194b64cff1	[#1708 ] Correctly set the writerIndex in ReadOnlyByteBufferBuf if it is constructed with a buffer which has non zero position	2013-08-08 06:54:32 +02:00
Norman Maurer	ea1dca8105	[#1704 ] Make sure SwappedByteBuf.readSlice(..) returns ByteBuf with correct ByteOrder	2013-08-06 12:22:22 +02:00
Norman Maurer	8a673db92b	[#1644 ] Fixed IndexOutOfBoundException when calling copy() on a empty CompositeByteBuf	2013-07-24 07:35:51 +02:00
Trustin Lee	764741c5ce	Change the contract of ResourceLeakDetector.open() so that unsampled resources are recycled - This also fixes the problem introduced while trying to implement #1612 (Allow to disable resource leak detection).	2013-07-23 14:06:58 +09:00
Norman Maurer	35802207e1	Fix compile error	2013-07-23 06:42:41 +02:00
kerr	ada07cb9e0	Fix types in javadocs	2013-07-22 19:14:36 +02:00
Norman Maurer	f478fcd3b0	[#1628 ] Fix bug in ReadOnlyByteByteBufferBuf where get operations threw ReadOnlyBufferException	2013-07-22 07:12:05 +02:00

... 2 3 4 5 6 ...

573 Commits