netty5

Author	SHA1	Message	Date
Norman Maurer	45e9fd4802	Revert "Provide a way to cache the internal nioBuffer of the PooledByteBuffer to reduce GC. (#8593 )" This reverts commit `945f123f8b` as it seems to produce some failures in some cases. This needs more research.	2018-11-27 20:03:33 +01:00
Norman Maurer	945f123f8b	Provide a way to cache the internal nioBuffer of the PooledByteBuffer to reduce GC. (#8593 ) Motivation: Often a temporary ByteBuffer is used which can be cached to reduce the GC pressure. Modifications: Add a Deque per PoolChunk which will be used for caching. Result: Less GC.	2018-11-27 13:55:24 +01:00
Rolandz	836c39b82f	Fix offset calculation in PooledByteBufAllocator when used Motivation: When we create new chunk with memory aligned, the offset of direct memory should be 'alignment - address & (alignment - 1)', not just 'address & (alignment - 1)'. Modification: Change offset calculating formula to offset = alignment - address & (alignment - 1) in PoolArena.DirectArena#offsetCacheLine and add a unit test to assert that. Result: Correctly calculate offset.	2018-11-27 11:47:40 +01:00
Nick Hill	804e1fa9cc	Fix ref-counting when CompositeByteBuf is used with retainedSlice() (#8497 ) Motivation: ByteBuf.retainedSlice() and similar methods produce sliced buffers with an independent refcount to the buffer that they wrap. One of the optimizations in `10539f4dc7` was to use the ref to the unwrapped buffer object for added slices, but this did not take into account the above special case when later releasing. Thanks to @rkapsi for discovering this via #8495. Modifications: Since a reference to the slice is still kept in the Component class, just changed Component.freeIfNecessary() to release the slice in preference to the unwrapped buf. Also added a unit test which reproduces the bug. Result: Fixes #8495	2018-11-13 20:56:09 +01:00
Nick Hill	0f8ce1b284	Fix incorrect sizing of temp byte arrays in (Unsafe)ByteBufUtil (#8484 ) Motivation: Two similar bugs were introduced by myself in separate recent PRs #8393 and #8464, while optimizing the assignment/handling of temporary arrays in ByteBufUtil and UnsafeByteBufUtil. The temp arrays allocated for buffering data written to an OutputStream are incorrectly sized to the full length of the data to copy rather than being capped at WRITE_CHUNK_SIZE. Unfortunately one of these is in the 4.1.31.Final release, I'm really sorry and will be more careful in future. This kind of thing is tricky to cover in unit tests. Modifications: Revert the temp array allocations back to their original sizes. Avoid making duplicate calls to ByteBuf.capacity() in a couple of places in ByteBufUtil (unrelated thing I noticed, can remove it from this PR if desired!) Result: Temporary byte arrays will be reverted to their originally intended sizes.	2018-11-09 18:24:38 +01:00
Nick Hill	5954110b9a	Use ByteBufUtil.BYTE_ARRAYS ThreadLocal temporary arrays in more places (#8464 ) Motivation: #8388 introduced a reusable ThreadLocal<byte[]> for use in decodeString(...). It can be used in more places in the buffer package to avoid temporary allocations of small arrays. Modifications: Encapsulate use of the ThreadLocal in a static package-private ByteBufUtil.threadLocalTempArray(int) method, and make use of it from a handful of new places including ByteBufUtil.readBytes(...). Result: Fewer short-lived small byte array allocations.	2018-11-05 21:11:28 +01:00
Nick Hill	10539f4dc7	Streamline CompositeByteBuf internals (#8437 ) Motivation: CompositeByteBuf is a powerful and versatile abstraction, allowing for manipulation of large data without copying bytes. There is still a non-negligible cost to reading/writing however relative to "singular" ByteBufs, and this can be mostly eliminated with some rework of the internals. My use case is message modification/transformation while zero-copy proxying. For example replacing a string within a large message with one of a different length Modifications: - No longer slice added buffers and unwrap added slices - Components store target buf offset relative to position in composite buf - Less allocations, object footprint, pointer indirection, offset arithmetic - Use Component[] rather than ArrayList<Component> - Avoid pointer indirection and duplicate bounds check, more efficient backing array growth - Facilitates optimization when doing bulk-inserts - inserting n ByteBufs behind m is now O(m + n) instead of O(mn) - Avoid unnecessary casting and method call indirection via superclass - Eliminate some duplicate range/ref checks via non-checking versions of toComponentIndex and findComponent - Add simple fast-path for toComponentIndex(0); add racy cache of last-accessed Component to findComponent(int) - Override forEachByte0(...) and forEachByteDesc0(...) methods - Make use of RecyclableArrayList in nioBuffers(int, int) (in line with FasterCompositeByteBuf impl) - Modify addComponents0(boolean,int,Iterable) to use the Iterable directly rather than copy to an array first (and possibly to an ArrayList before that) - Optimize addComponents0(boolean,int,ByteBuf[],int) to not perform repeated array insertions and avoid second loop for offset updates - Simplify other logic in various places, in particular the general pattern used where a sub-range is iterated over - Add benchmarks to demonstrate some improvements While refactoring I also came across a couple of clear bugs. They are fixed in these changes but I will open another PR with unit tests and fixes to the current version. Result: Much faster creation, manipulation, and access; many fewer allocations and smaller footprint. Benchmark results to follow.	2018-11-03 10:37:07 +01:00
Nick Hill	44cca1a26f	Avoid allocations when wrapping byte[] and ByteBuffer arrays as ByteBuf (#8420 ) Motivation: Unpooled.wrap(byte[]...) and Unpooled.wrap(ByteBuffer...) currently allocate/copy an intermediate ByteBuf ArrayList and array, which can be avoided. Modifications: - Define new internal ByteWrapper interface and add a CompositeByteBuf constructor which takes a ByteWrapper with an array of the type that it wraps, and modify the appropriate Unpooled.wrap(...) methods to take advantage of it - Tidy up other constructors in CompositeByteBuf to remove duplication and misleading len arg (which is really an end offset into provided array) Result: Less allocation/copying when wrapping byte[] and ByteBuffer arrays, tidier code.	2018-10-30 19:35:39 +01:00
Nick Hill	48c45cf4ac	Fix leak and corruption bugs in CompositeByteBuf (#8438 ) Motivation: I came across two bugs: - Components removed due to capacity reduction aren't released - Offsets aren't set correctly on empty components that are added between existing components Modifications: Add unit tests which expose these bugs, fix them. Result: Bugs are fixed	2018-10-28 10:28:18 +01:00
Nick Hill	d7fa7be67f	Exploit PlatformDependent.allocateUninitializedArray() in more places (#8393 ) Motivation: There are currently many more places where this could be used which were possibly not considered when the method was added. If https://github.com/netty/netty/pull/8388 is included in its current form, a number of these places could additionally make use of the same BYTE_ARRAYS threadlocal. There's also a couple of adjacent places where an optimistically-pooled heap buffer is used for temp byte storage which could use the threadlocal too in preference to allocating a temp heap bytebuf wrapper. For example https://github.com/netty/netty/blob/4.1/buffer/src/main/java/io/netty/buffer/ByteBufUtil.java#L1417. Modifications: Replace new byte[] with PlatformDependent.allocateUninitializedArray() where appropriate; make use of ByteBufUtil.getBytes() in some places which currently perform the equivalent logic, including avoiding copy of backing array if possible (although would be rare). Result: Further potential speed-up with java9+ and appropriate compile flags. Many of these places could be on latency-sensitive code paths.	2018-10-27 10:43:28 -05:00
Nick Hill	583d838f7c	Optimize AbstractByteBuf.getCharSequence() in US_ASCII case (#8392 ) * Optimize AbstractByteBuf.getCharSequence() in US_ASCII case Motivation: Inspired by https://github.com/netty/netty/pull/8388, I noticed this simple optimization to avoid char[] allocation (also suggested in a TODO here). Modifications: Return an AsciiString from AbstractByteBuf.getCharSequence() if requested charset is US_ASCII or ISO_8859_1 (latter thanks to @Scottmitch's suggestion). Also tweak unit tests not to require Strings and include a new benchmark to demonstrate the speedup. Result: Speed-up of AbstractByteBuf.getCharSequence() in ascii and iso 8859/1 cases	2018-10-26 15:32:38 -07:00
Norman Maurer	87ec2f882a	Reduce overhead by ByteBufUtil.decodeString(...) which is used by `AbstractByteBuf.toString(...)` and `AbstractByteBuf.getCharSequence(...)` (#8388 ) Motivation: Our current implementation that is used for toString(Charset) operations on AbstractByteBuf implementation is quite slow as it does a lot of uncessary memory copies. We should just use new String(...) as it has a lot of optimizations to handle these cases. Modifications: Rewrite ByteBufUtil.decodeString(...) to use new String(...) Result: Less overhead for toString(Charset) operations. Benchmark (charsetName) (direct) (size) Mode Cnt Score Error Units ByteBufUtilDecodeStringBenchmark.decodeString US-ASCII false 8 thrpt 20 22401645.093 ? 4671452.479 ops/s ByteBufUtilDecodeStringBenchmark.decodeString US-ASCII false 64 thrpt 20 23678483.384 ? 3749164.446 ops/s ByteBufUtilDecodeStringBenchmark.decodeString US-ASCII true 8 thrpt 20 15731142.651 ? 3782931.591 ops/s ByteBufUtilDecodeStringBenchmark.decodeString US-ASCII true 64 thrpt 20 16244232.229 ? 1886259.658 ops/s ByteBufUtilDecodeStringBenchmark.decodeString UTF-8 false 8 thrpt 20 25983680.959 ? 5045782.289 ops/s ByteBufUtilDecodeStringBenchmark.decodeString UTF-8 false 64 thrpt 20 26235589.339 ? 2867004.950 ops/s ByteBufUtilDecodeStringBenchmark.decodeString UTF-8 true 8 thrpt 20 18499027.808 ? 4784684.268 ops/s ByteBufUtilDecodeStringBenchmark.decodeString UTF-8 true 64 thrpt 20 16825286.141 ? 1008712.342 ops/s ByteBufUtilDecodeStringBenchmark.decodeString UTF-16 false 8 thrpt 20 5789879.092 ? 1201786.359 ops/s ByteBufUtilDecodeStringBenchmark.decodeString UTF-16 false 64 thrpt 20 2173243.225 ? 417809.341 ops/s ByteBufUtilDecodeStringBenchmark.decodeString UTF-16 true 8 thrpt 20 5035583.011 ? 1001978.854 ops/s ByteBufUtilDecodeStringBenchmark.decodeString UTF-16 true 64 thrpt 20 2162345.301 ? 402410.408 ops/s ByteBufUtilDecodeStringBenchmark.decodeString ISO-8859-1 false 8 thrpt 20 30039052.376 ? 6539111.622 ops/s ByteBufUtilDecodeStringBenchmark.decodeString ISO-8859-1 false 64 thrpt 20 31414163.515 ? 2096710.526 ops/s ByteBufUtilDecodeStringBenchmark.decodeString ISO-8859-1 true 8 thrpt 20 19538587.855 ? 4639115.572 ops/s ByteBufUtilDecodeStringBenchmark.decodeString ISO-8859-1 true 64 thrpt 20 19467839.722 ? 1672687.213 ops/s ByteBufUtilDecodeStringBenchmark.decodeStringOld US-ASCII false 8 thrpt 20 10787326.745 ? 1034197.864 ops/s ByteBufUtilDecodeStringBenchmark.decodeStringOld US-ASCII false 64 thrpt 20 7129801.930 ? 1363019.209 ops/s ByteBufUtilDecodeStringBenchmark.decodeStringOld US-ASCII true 8 thrpt 20 9002529.605 ? 2017642.445 ops/s ByteBufUtilDecodeStringBenchmark.decodeStringOld US-ASCII true 64 thrpt 20 3860192.352 ? 826218.738 ops/s ByteBufUtilDecodeStringBenchmark.decodeStringOld UTF-8 false 8 thrpt 20 10532838.027 ? 2151743.968 ops/s ByteBufUtilDecodeStringBenchmark.decodeStringOld UTF-8 false 64 thrpt 20 7185554.597 ? 1387685.785 ops/s ByteBufUtilDecodeStringBenchmark.decodeStringOld UTF-8 true 8 thrpt 20 7352253.316 ? 1333823.850 ops/s ByteBufUtilDecodeStringBenchmark.decodeStringOld UTF-8 true 64 thrpt 20 2825578.707 ? 349701.156 ops/s ByteBufUtilDecodeStringBenchmark.decodeStringOld UTF-16 false 8 thrpt 20 7277446.665 ? 1447034.346 ops/s ByteBufUtilDecodeStringBenchmark.decodeStringOld UTF-16 false 64 thrpt 20 2445929.579 ? 562816.641 ops/s ByteBufUtilDecodeStringBenchmark.decodeStringOld UTF-16 true 8 thrpt 20 6201174.401 ? 1236137.786 ops/s ByteBufUtilDecodeStringBenchmark.decodeStringOld UTF-16 true 64 thrpt 20 2310674.973 ? 525587.959 ops/s ByteBufUtilDecodeStringBenchmark.decodeStringOld ISO-8859-1 false 8 thrpt 20 11142625.392 ? 1680556.468 ops/s ByteBufUtilDecodeStringBenchmark.decodeStringOld ISO-8859-1 false 64 thrpt 20 8127116.405 ? 1128513.860 ops/s ByteBufUtilDecodeStringBenchmark.decodeStringOld ISO-8859-1 true 8 thrpt 20 9405751.952 ? 2193324.806 ops/s ByteBufUtilDecodeStringBenchmark.decodeStringOld ISO-8859-1 true 64 thrpt 20 3943282.076 ? 737798.070 ops/s Benchmark result is saved to /home/norman/mainframer/netty/microbench/target/reports/performance/ByteBufUtilDecodeStringBenchmark.json Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 1,030.173 sec - in io.netty.buffer.ByteBufUtilDecodeStringBenchmark [1030.460s][info ][gc,heap,exit ] Heap [1030.460s][info ][gc,heap,exit ] garbage-first heap total 516096K, used 257918K [0x0000000609a00000, 0x0000000800000000) [1030.460s][info ][gc,heap,exit ] region size 2048K, 127 young (260096K), 2 survivors (4096K) [1030.460s][info ][gc,heap,exit ] Metaspace used 17123K, capacity 17438K, committed 17792K, reserved 1064960K [1030.460s][info ][gc,heap,exit ] class space used 1709K, capacity 1827K, committed 1920K, reserved 1048576K	2018-10-19 14:00:13 +02:00
Norman Maurer	69545aedc4	CompositeByteBuf.decompose(...) does not correctly slice content. (#8403 ) Motivation: CompositeByteBuf.decompose(...) did not correctly slice the content and so produced an incorrect representation of the data. Modifications: - Rewrote implementation to fix bug and also improved it to reduce GC - Add unit tests. Result: Fixes https://github.com/netty/netty/issues/8400.	2018-10-19 08:05:22 +02:00
Nick Hill	7062ceedb0	Simplify ByteBufInputStream.readLine() logic (#8380 ) Motivation: While looking at the nice optimization done in https://github.com/netty/netty/pull/8347 I couldn't help noticing the logic could be simplified further. Apologies if this is just my OCD and inappropriate! Modifications: Reduce amount of code used for ByteBufInputStream.readLine() Result: Slightly smaller and simpler code	2018-10-13 06:24:40 +02:00
Francesco Nigro	83dc3b503e	ByteBufInputStream is always allocating a StringBuilder instance (#8347 ) Motivation: Avoid creating any StringBuilder instance if ByteBufInputStream::readLine isn't used Modifications: The StringBuilder instance is lazy allocated on demand and are added new test case branches to address the increased complexity of ByteBufInputStream::readLine Result: Reduced GC activity if ByteBufInputStream::readLine isn't used	2018-10-11 14:56:29 +08:00
Dmitriy Dumanskiy	6cebb6069b	remove unnecessary vararg argument in PooledByteBufAllocator (#8338 ) Motivation: No need in varargs, the method always accepts array. Modification: ... replaced with []	2018-10-05 19:06:44 +08:00
Norman Maurer	c14efd952d	Directly init refCnt to 1 (#8274 ) Motivation: We should just directly init the refCnt to 1 and not use the AtomicIntegerFieldUpdater. Modifications: Just assing directly to 1. Result: Cleaner code and possible a bit faster as the JVM / JIT may be able to optimize the first store easily.	2018-09-07 19:04:19 +02:00
Norman Maurer	e542a2cf26	Use a non-volatile read for ensureAccessible() whenever possible to reduce overhead and allow better inlining. (#8266 ) Motiviation: At the moment whenever ensureAccessible() is called in our ByteBuf implementations (which is basically on each operation) we will do a volatile read. That per-se is not such a bad thing but the problem here is that it will also reduce the the optimizations that the compiler / jit can do. For example as these are volatile it can not eliminate multiple loads of it when inline the methods of ByteBuf which happens quite frequently because most of them a quite small and very hot. That is especially true for all the methods that act on primitives. It gets even worse as people often call a lot of these after each other in the same method or even use method chaining here. The idea of the change is basically just ue a non-volatile read for the ensureAccessible() check as its a best-effort implementation to detect acting on already released buffers anyway as even with a volatile read it could happen that the user will release it in another thread before we actual access the buffer after the reference check. Modifications: - Try to do a non-volatile read using sun.misc.Unsafe if we can use it. - Add a benchmark Result: Big performance win when multiple ByteBuf methods are called from a method. With the change: UnsafeByteBufBenchmark.setGetLongUnsafeByteBuf thrpt 20 281395842,128 ± 5050792,296 ops/s Before the change: UnsafeByteBufBenchmark.setGetLongUnsafeByteBuf thrpt 20 217419832,801 ± 5080579,030 ops/s	2018-09-07 07:47:02 +02:00
Francesco Nigro	c78be33443	Added configurable ByteBuf bounds checking (#7521 ) Motivation: The JVM isn't always able to hoist out/reduce bounds checking (due to ref counting operations etc etc) hence making it configurable could improve performances for most CPU intensive use cases. Modifications: Each AbstractByteBuf bounds check has been tested against a new static final configuration property similar to checkAccessible ie io.netty.buffer.bytebuf.checkBounds. Result: Any user could disable ByteBuf bounds checking in order to get extra performances.	2018-09-03 20:33:47 +02:00
Norman Maurer	54f565ac67	Allow to use native transports when sun.misc.Unsafe is not present on… (#8231 ) * Allow to use native transports when sun.misc.Unsafe is not present on the system Motivation: We should be able to use the native transports (epoll / kqueue) even when sun.misc.Unsafe is not present on the system. This is especially important as Java11 will be released soon and does not allow access to it by default. Modifications: - Correctly disable usage of sun.misc.Unsafe when -PnoUnsafe is used while running the build - Correctly increment metric when UnpooledDirectByteBuf is allocated. This was uncovered once -PnoUnsafe usage was fixed. - Implement fallbacks in all our native transport code for when sun.misc.Unsafe is not present. Result: Fixes https://github.com/netty/netty/issues/8229.	2018-08-29 19:36:33 +02:00
vincent-grosbois	0bea8ecf5d	CompositeByteBuf nioBuffer doesn't always alloc (#8176 ) In nioBuffer(int,int) in CompositeByteBuf , we create a sub-array of nioBuffers for the components that are in range, then concatenate all the components in range into a single bigger buffer. However, if the call to nioBuffers() returned only one sub-buffer, then we are copying it to a newly-allocated buffer "merged" for no reason. Motivation: Profiler for Spark shows a lot of time spent in put() method inside nioBuffer(), while usually no copy of data is required. Modification: This change skips this last step and just returns a duplicate of the single buffer returned by the call to nioBuffers(), which will in most implementation not copy the data Result: No copy when the source is only 1 buffer	2018-08-07 11:31:24 +02:00
Norman Maurer	9b08dbca00	Leak detection combined with composite buffers results in incorrectly handled writerIndex when calling ByteBufUtil.writeAscii/writeUtf8 (#8153 ) Motivation: We need to add special handling for WrappedCompositeByteBuf as these also extend AbstractByteBuf, otherwise we will not correctly adjust / read the writerIndex during processing. Modifications: - Add instanceof checks for WrappedCompositeByteBuf as well. - Add testcases Result: Fixes https://github.com/netty/netty/issues/8152.	2018-07-27 01:56:09 +08:00
Norman Maurer	6afab517b0	Guard against calling PoolThreadCache.free() multiple times. (#8108 ) Motivation: `5b1fe611a6` introduced the usage of a finalizer as last resort for PoolThreadCache. As we may call free() from the FastThreadLocal.onRemoval(...) and finalize() we need to guard against multiple calls as otherwise we will corrupt internal state (that is used for metrics). Modifications: Use AtomicBoolean to guard against multiple calls of PoolThreadCache.free(). Result: No more corruption of internal state caused by calling PoolThreadCache.free() multuple times.	2018-07-09 15:58:12 -04:00
Nick Hill	fef462c043	Deprecate Unpooled.unmodifiableBuffer(ByteBuf...) (#8096 ) Motivation: Recent PR https://github.com/netty/netty/pull/8040 introduced Unpooled.wrappedUnmodifiableBuffer(ByteBuf...) which has the same behaviour but wraps the provided array directly. This is preferred for most uses (including varargs-based use) and if there are any unusual cases of an explicit array which is re-used before the ByteBuf is finished with, it can just be copied first. Modifications: Added @Deprecated annotation and javadoc to Unpooled.unmodifiableBuffer(ByteBuf...). Result: Unpooled.unmodifiableBuffer(ByteBuf...) will be deprecated.	2018-07-07 14:45:27 -04:00
Norman Maurer	83710cb2e1	Replace toArray(new T[size]) with toArray(new T[0]) to eliminate zero-out and allow the VM to optimize. (#8075 ) Motivation: Using toArray(new T[0]) is usually the faster aproach these days. We should use it. See also https://shipilev.net/blog/2016/arrays-wisdom-ancients/#_conclusion. Modifications: Replace toArray(new T[size]) with toArray(new T[0]). Result: Faster code.	2018-06-29 07:56:04 +02:00
Norman Maurer	5b1fe611a6	Remove usage of ObjectCleaner (#8064 ) Motivation: ObjectCleaner does start a Thread to handle the cleaning of resources which leaks into the users application. We should not use it in netty itself to make things more predictable. Modifications: - Remove usage of ObjectCleaner and use finalize as a replacement when possible. - Clarify javadocs for FastThreadLocal.onRemoval(...) to ensure its clear that remove() is not guaranteed to be called when the Thread completees and so this method is not enough to guarantee cleanup for this case. Result: Fixes https://github.com/netty/netty/issues/8017.	2018-06-28 08:15:27 +02:00
nickhill	f164759ea3	Support composite buffer creation without array alloc and copy Motivation: Unpooled.unmodifiableBuffer() is currently used to efficiently write arrays of ByteBufs via FixedCompositeByteBuf, but involves an allocation and content-copy of the provided ByteBuf array which in many (most?) cases shouldn't be necessary. Modifications: Modify the internal FixedCompositeByteBuf class to support wrapping the provided ByteBuf array directly. Control this behaviour with a constructor flag and expose the "unsafe" version via a new Unpooled.wrappedUnmodifiableBuffer(ByteBuf...) method. Result: Less garbage on IO paths. I would guess pretty much all existing usage of unmodifiableBuffer() could use the copy-free version but assume it's not safe to change its default behaviour.	2018-06-27 07:40:14 +02:00
nickhill	9b95b8ee62	Reduce array allocations during CompositeByteBuf construction Motivation: Eliminate avoidable backing array reallocations when constructing composite ByteBufs from existing buffer arrays/Iterables. This also applies to the Unpooled.wrappedBuffer(...) methods. Modifications: Ensure the initial components ComponentList is sized at least as large as the provided buffer array/Iterable in the CompositeByteBuffer constructors. In single-arg Unpooled.wrappedBuffer(...) methods, set maxNumComponents to the count of provided buffers, rather than a fixed default of 16. It seems likely that most usage of these involves wrapping a list without subsequent modification, particularly since they return a ByteBuf rather than CompositeByteBuf. If a different/larger max is required there are already the wrappedBuffer(int, ...) variants. In fact the current behaviour could be considered inconsistent - if you call Unpooled.wrappedBuffer(int, ByteBuf) with a single buffer, you might expect to subsequently be able to add buffers to it (since you specified a max related to consolidation), but it will in fact return just a slice of the provided ByteBuf. Result: Fewer and smaller allocations in some cases when using CompositeByteBufs or Unpooled.wrappedBuffer(...).	2018-06-20 16:09:23 +02:00
Tim Brooks	35215309b9	Make UnpooledHeapByteBuf array methods protected (#8015 ) Motivation: Currently there is not a clear way to provide a byte array to a netty ByteBuf and be informed when it is released. This is a would be a valuable addition for projects that integrate with netty but also pool their own byte arrays. Modification: Modified the UnpooledHeapByteBuf class so that the freeArray method is protected visibility instead of default. This will allow a user to subclass the UnpooledHeapByteBuf, provide a byte array, and override freeArray to return the byte array to a pool when it is called. Additionally this makes this implementation equivalent to UnpooledDirectByteBuf (freeDirect is protected). Additionally allocateArray is also made protect to provide another override option for subclasses. Result: Users can override UnpooledHeapByteBuf#freeArray and UnpooledHeapByteBuf#allocateArray.	2018-06-13 11:43:31 -07:00
zekaryu	09d9daf1c4	Update the comment of io.netty.buffer.PoolChunk.java (#7934 ) Motivation: When I read the source code, I found that the comment of PoolChunk is out of date, it may confuses readers with the description about memoryMap. Modifications: update the last passage of the comment of the PoolChunk class. Result: No change to any source code , just update comment.	2018-05-14 15:44:32 +02:00
Xiaoyan Lin	a0ed6ec06c	Fix the error message in ReferenceCounted.release (#7921 ) Motivation: When a buffer is over-released, the current error message of `IllegalReferenceCountException` is `refCnt: XXX, increment: XXX`, which is confusing. The correct message should be `refCnt: XXX, decrement: XXX`. Modifications: Pass `-decrement` to create `IllegalReferenceCountException`. Result: The error message will be `refCnt: XXX, decrement: XXX` when a buffer is over-released.	2018-05-08 20:09:16 +02:00
Rikki Gibson	1b1f7677ac	Implement isWritable and ensureWritable on ReadOnlyByteBufferBuf (#7883 ) Motivation: It should be possible to write a ReadOnlyByteBufferBuf to a channel without errors. However, ReadOnlyByteBufferBuf does not override isWritable and ensureWritable, which can cause some handlers to mistakenly assume they can write to the ReadOnlyByteBufferBuf, resulting in ReadOnlyBufferException. Modification: Added isWritable and ensureWritable method overrides on ReadOnlyByteBufferBuf to indicate that it is never writable. Added tests for these methods. Result: Can successfully write ReadOnlyByteBufferBuf to a channel with an SslHandler (or any other handler which may attempt to write to the ByteBuf it receives).	2018-04-23 08:31:30 +02:00
Nikolay Fedorovskikh	81a7d1413b	Makes `EmptyByteBuf#hashCode` and `AbstractByteBuf#hashCode` consistent (#7870 ) Motivation: The `AbstractByteBuf#equals` method doesn't take into account the class of buffer instance. So the two buffers with different classes must have the same `hashCode` values if `equals` method returns `true`. But `EmptyByteBuf#hashCode` is not consistent with `#hashCode` of the empty `AbstractByteBuf`, that is violates the contract and can lead to errors. Modifications: Return `1` in `EmptyByteBuf#hashCode`. Result: Consistent behavior of `EmptyByteBuf#hashCode` and `AbstractByteBuf#hashCode`.	2018-04-16 12:11:42 +02:00
Nikolay Fedorovskikh	f8ff834f03	Checks accessibility in the #slice and #duplicate methods of ByteBuf (#7846 ) Motivation: The `ByteBuf#slice` and `ByteBuf#duplicate` methods should check an accessibility to prevent creation slice or duplicate of released buffer. At now this works not in the all scenarios. Modifications: Add missed checks. Result: More correct and consistent behavior of `ByteBuf` methods.	2018-04-10 10:41:50 +02:00
Nikolay Fedorovskikh	a95fd91bc6	Don't check accessible in the #capacity method (#7830 ) Motivation: The `#ensureAccessible` method in `UnpooledHeapByteBuf#capacity` used to prevent NPE if buffer is released and `array` is `null`. In all other implementations of `ByteBuf` the accessible is not checked by `capacity` method. We can assign an empty array to `array` in the `deallocate` and don't worry about NPE in the `#capacity`. This will help reduce the number of repeated calls of the `#ensureAccessible` in many operations with `UnpooledHeapByteBuf`. Modifications: 1. Remove `#ensureAccessible` call from `UnpooledHeapByteBuf#capacity`. Use the `EmptyArrays#EMPTY_BYTES` instead of `null` in `#deallocate`. 2. Fix access checks in `AbstractUnsafeSwappedByteBuf` and `AbstractByteBuf#slice` that relied on `#ensureAccessible` in `UnpooledHeapByteBuf#capacity`. This was found by unit tests. Result: Less double calls of `#ensureAccessible` for `UnpooledHeapByteBuf`.	2018-04-03 21:35:02 +02:00
Norman Maurer	965734a1eb	Limit the number of bytes to use to copy the content of a direct buffer to an Outputstream (#7813 ) Motivation: Currently copying a direct ByteBuf copies it fully into the heap before writing it to an output stream. The can result in huge memory usage on the heap. Modification: copy the bytebuf contents via an 8k buffer into the output stream Result: Fixes #7804	2018-03-29 12:49:27 +02:00
Norman Maurer	bd772d127e	FixedCompositeByteBuf should allow to access memoryAddress / array when wrap a single buffer. Motivation: We should allow to access the memoryAddress / array of the FixedCompositeByteBuf when it only wraps a single ByteBuf. We do the same for CompositeByteBuf. Modifications: - Check how many buffers FixedCompositeByteBuf wraps and depending on it delegate the access to the memoryAddress / array - Add unit tests. Result: Fixes [#7752].	2018-03-13 08:50:42 +01:00
kakashiio	12ccd40c5a	Correctly throw IndexOutOfBoundsException when writerIndex < readerIndex Motivation: If someone invoke writeByte(), markWriterIndex(), readByte() in order first, and then invoke resetWriterIndex() should be throw a IndexOutOfBoundsException to obey the rule that the buffer declared "0 <= readerIndex <= writerIndex <= capacity". Modification: Changed the code writerIndex = markedWriterIndex; into writerIndex(markedWriterIndex); to make the check affect Result: Throw IndexOutOfBoundsException if any invalid happened in resetWriterIndex.	2018-03-02 10:05:33 +09:00
Francesco Nigro	ed46c4ed00	Copies from read-only heap ByteBuffer to direct ByteBuf can avoid stealth ByteBuf allocation and additional copies Motivation: Read-only heap ByteBuffer doesn't expose array: the existent method to perform copies to direct ByteBuf involves the creation of a (maybe pooled) additional heap ByteBuf instance and copy Modifications: To avoid stressing the allocator with additional (and stealth) heap ByteBuf allocations is provided a method to perform copies using the (pooled) internal NIO buffer Result: Copies from read-only heap ByteBuffer to direct ByteBuf won't create any intermediate ByteBuf	2018-02-27 09:54:21 +09:00
Francesco Nigro	bc8e022601	Added exact utf8 length estimator and exposed writeUtf8 with custom space reservation on destination buffer Motivation: To avoid eager allocation of the destination and to perform length prefixed encoding of UTF-8 string with forward only access pattern Modifications: The original writeUtf8 is modified by allowing customization of the reserved bytes on the destination buffer and is introduced an exact UTF-8 length estimator. Result: Is now possible to perform length first encoding with UTF-8 well-formed char sequences following a forward only write access pattern on the destination buffer.	2018-02-16 11:52:35 +01:00
Scott Mitchell	108fbe5282	ByteBufUtil to not pool direct memory by default Motivation: ByteBufUtil by default will cache DirectByteBuffer objects, and the associated direct memory (up to 64k). In combination with the Recycler which may cache up to 32k elements per thread may lead to a large amount of direct memory being retained per EventLoop thread. As traffic spikes come this may be perceived as a memory leak because the memory in the Recycler will never be reclaimed. Modifications: - By default we shouldn't cache DirectByteBuffer objects. Result: Less direct memory consumption due to caching DirectByteBuffer objects.	2018-02-12 10:49:17 -08:00
Norman Maurer	fbbaf2bd7e	Cleanup buffer tests. Motivation: There is some cleanup that can be done. Modifications: - Use intializer list expression where possible - Remove unused imports. Result: Cleaner code.	2018-02-02 07:33:46 +01:00
Norman Maurer	011841e454	ReadOnlyUnsafeDirectByteBuf.memoryAddress() should not throw Motivation: We need the memoryAddress of a direct buffer when using our native transports. For this reason ReadOnlyUnsafeDirectByteBuf.memoryAddress() should not throw. Modifications: - Correctly override ReadOnlyUnsafeDirectByteBuf.memoryAddress() and hasMemoryAddress() - Add test case Result: Fixes [#7672].	2018-02-02 07:27:26 +01:00
Norman Maurer	95b9b0af5c	Increase timeout and decrement number of operations in AbstractByteBufTest.testToStringMultipleThreads Motivation: We saw some timeouts on the CI when the leak detection is enabled. Modifications: - Use smaller number of operations in test - Increase timeout Result: CI not times out.	2018-01-31 14:57:38 +01:00
Norman Maurer	d2bd36fc4c	ByteBufUtil.isText method should be safe to be called concurrently Motivation: ByteBufUtil.isText(...) may produce unexpected results if called concurrently on the same ByteBuffer. Modifications: - Don't use internalNioBuffer where it is not safe. - Add unit test. Result: ByteBufUtil.isText is thread-safe.	2018-01-31 13:47:49 +01:00
Scott Mitchell	4921f62c8a	HttpResponseStatus object allocation reduction Motivation: Usages of HttpResponseStatus may result in more object allocation then necessary due to not looking for cached objects and the AsciiString parsing method not being used due to CharSequence method being used instead. Modifications: - HttpResponseDecoder should attempt to get the HttpResponseStatus from cache instead of allocating a new object - HttpResponseStatus#parseLine(CharSequence) should check if the type is AsciiString and redirect to the AsciiString parsing method which may not require an additional toString call - HttpResponseStatus#parseLine(AsciiString) can be optimized and doesn't require and may not require object allocation Result: Less allocations when dealing with HttpResponseStatus.	2018-01-24 22:01:52 -08:00
Norman Maurer	336bea9dc5	Fix ByteBuf.nioBuffer(...) and nioBuffers(...) docs to reflect reality. Motivation: Depending on the implementation of ByteBuf nioBuffer(...) and nioBuffers(...) may either share the content or return a ByteBuffer that contains a copy of the content. Modifications: Fix javadocs. Result: Correct docs.	2018-01-22 19:50:08 +01:00
Thomas Devanneaux	3ae57cf302	ByteBuf.toString(Charset) is not thread-safe Motivation: Calling ByteBuf.toString(Charset) on the same buffer from multiple threads at the same time produces unexpected results, such as various exceptions and/or corrupted output. This is because ByteBufUtil.decodeString(...) is taking the source ByteBuffer for CharsetDecoder.decode() from ByteBuf.internalNioBuffer(int, int), which is not thread-safe. Modification: Call ByteBuf.nioBuffer() instead of ByteBuf.internalNioBuffer() to get the source buffer to pass to CharsetDecoder.decode(). Result: Fixes the possible race condition.	2018-01-21 09:02:42 +01:00
Norman Maurer	819b870b8a	Correctly take position into account when wrap a ByteBuffer via ReadOnlyUnsafeDirectByteBuf Motivation: We did not correctly take the position into account when wrapping a ByteBuffer via ReadOnlyUnsafeDirectByteBuf as we obtained the memory address from the original ByteBuffer and not the slice we take. Modifications: - Correctly use the slice to obtain memory address. - Add test case. Result: Fixes [#7565].	2018-01-16 19:18:59 +01:00
Norman Maurer	e329ca1cf3	Introduce ObjectCleaner and use it in FastThreadLocal to ensure FastThreadLocal.onRemoval(...) is called Motivation: There is no guarantee that FastThreadLocal.onRemoval(...) is called if the FastThreadLocal is used by "non" FastThreacLocalThreads. This can lead to all sort of problems, like for example memory leaks as direct memory is not correctly cleaned up etc. Beside this we use ThreadDeathWatcher to check if we need to release buffers back to the pool when thread local caches are collected. In the past ThreadDeathWatcher was used which will need to "wakeup" every second to check if the registered Threads are still alive. If we can ensure FastThreadLocal.onRemoval(...) is called we do not need this anymore. Modifications: - Introduce ObjectCleaner and use it to ensure FastThreadLocal.onRemoval(...) is always called when a Thread is collected. - Deprecate ThreadDeathWatcher - Add unit tests. Result: Consistent way of cleanup FastThreadLocals when a Thread is collected.	2017-12-21 07:34:44 +01:00
Norman Maurer	1988cd041d	Reduce Object allocations in CompositeByteBuf. Motivation: We used subList in CompositeByteBuf to remove ranges of elements from the internal storage. Beside this we also used an foreach loop in a few cases which will crate an Iterator. Modifications: - Use our own sub-class of ArrayList which exposes removeRange(...). This allows to remove a range of elements without an extra allocation. - Use an old style for loop to iterate over the elements to reduce object allocations. Result: Less allocations.	2017-12-12 09:08:58 +01:00
Norman Maurer	09a05b680d	Dont use ThreadDeathWatcher to cleanup PoolThreadCache if FastThreadLocalThread with wrapped Runnable is used Motivation: We dont need to use the ThreadDeathWatcher if we use a FastThreadLocalThread for which we wrap the Runnable and ensure we call FastThreadLocal.removeAll() once the Runnable completes. Modifications: - Dont use a ThreadDeathWatcher if we are sure we will call FastThreadLocal.removeAll() - Add unit test. Result: Less overhead / running theads if you only allocate / deallocate from FastThreadLocalThreads.	2017-11-28 13:43:28 +01:00
Scott Mitchell	a8bb9dc180	AbstractByteBuf readSlice bound check bug Motivation: AbstractByteBuf#readSlice relied upon the bounds checking of the slice operation in order to detect index out of bounds conditions. However the slice bounds checking operation allows for the slice to go beyond the writer index, and this is out of bounds for a read operation. Modifications: - AbstractByteBuf#readSlice and AbstractByteBuf#readRetainedSlice should ensure the desired amount of bytes are readable before taking a slice Result: No reading of undefined data in AbstractByteBuf#readSlice and AbstractByteBuf#readRetainedSlice.	2017-11-18 09:03:42 +01:00
Norman Maurer	cc069722a2	CompositeBytebuf.copy() and copy(...) should respect the allocator Motivation: When calling CompositeBytebuf.copy() and copy(...) we currently use Unpooled to allocate the buffer. This is not really correct and may produce more GC then needed. We should use the allocator that was used when creating the CompositeByteBuf to allocate the new buffer which may be for example the PooledByteBufAllocator. Modifications: - Use alloc() to allocate the new buffer. - Add tests - Fix tests that depend on the copy to be backed by an byte-array without checking hasArray() first. Result: Fixes [#7393].	2017-11-10 07:17:16 -08:00
Idel Pivnitskiy	50a067a8f7	Make methods 'static' where it possible Motivation: Even if it's a super micro-optimization (most JVM could optimize such cases in runtime), in theory (and according to some perf tests) it may help a bit. It also makes a code more clear and allows you to access such methods in the test scope directly, without instance of the class. Modifications: Add 'static' modifier for all methods, where it possible. Mostly in test scope. Result: Cleaner code with proper 'static' modifiers.	2017-10-21 14:59:26 +02:00
Nikolay Fedorovskikh	17e1a26d64	Fixes a javadoc for ByteBufUtil#copy method Motivation: Javadoc of the `ByteBufUtil#copy(AsciiString, int, ByteBuf, int, int)` is incorrect. Modifications: Fix it. Result: The description of the `#copy` method is not misleading.	2017-10-21 14:51:20 +02:00
Nikolay Fedorovskikh	09dd6a5d4d	Minor improvements in ByteBufOutputStream Motivation: In the `ByteBufOutputStream` we can use an appropriate methods of `ByteBuf` to reduce calls of virtual methods and do not copying converting logic. Modifications: - Use an appropriate methods of `ByteBuf` - Remove redundant conversions (int -> byte, int -> char). - Use `ByteBuf#writeCharSequence` in the `writeBytes(String)'. Result: Less code duplication. A `writeBytes(String)` method is faster. No unnecessary conversions. More consistent and cleaner code.	2017-10-21 14:44:13 +02:00
Carl Mastrangelo	16b1dbdf92	Motivation: Resource Leak Detector (RLD) tries to helpfully indicate where an object was last accessed and report the accesses in the case the object was not cleaned up. It handles lightly used objects well, but drops all but the last few accesses. Configuring this is tough because there is split between highly shared (and accessed) objects and lightly accessed objects. Modification: There are a number of changes here. In relative order of importance: API / Functionality changes: * Max records and max sample records are gone. Only "target" records, the number of records tries to retain is exposed. * Records are sampled based on the number of already stored records. The likelihood of recording a new sample is `2^(-n)`, where `n` is the number of currently stored elements. * Records are stored in a concurrent stack structure rather than a list. This avoids a head and tail. Since the stack is only read once, there is no need to maintain head and tail pointers * The properties of this imply that the very first and very last access are always recorded. When deciding to sample, the top element is replaced rather than pushed. * Samples that happen between the first and last accesses now have a chance of being recorded. Previously only the final few were kept. * Sampling is no longer deterministic. Previously, a deterministic access pattern meant that you could conceivably always miss some access points. * Sampling has a linear ramp for low values and and exponentially backs off roughly equal to 2^n. This means that for 1,000,000 accesses, about 20 will actually be kept. I have an elegant proof for this which is too large to fit in this commit message. Code changes: * All locks are gone. Because sampling rarely needs to do a write, there is almost 0 contention. The dropped records counter is slightly contentious, but this could be removed or changed to a LongAdder. This was not done because of memory concerns. * Stack trace exclusion is done outside of RLD. Classes can opt to remove some of their methods. * Stack trace exclusion is faster, since it uses String.equals, often getting a pointer compare due to interning. Previously it used contains() * Leak printing is outputted fairly differently. I tried to preserve as much of the original formatting as possible, but some things didn't make sense to keep. Result: More useful leak reporting. Faster: ``` Before: Benchmark (recordTimes) Mode Cnt Score Error Units ResourceLeakDetectorRecordBenchmark.record 8 thrpt 20 136293.404 ± 7669.454 ops/s ResourceLeakDetectorRecordBenchmark.record 16 thrpt 20 72805.720 ± 3710.864 ops/s ResourceLeakDetectorRecordBenchmark.recordWithHint 8 thrpt 20 139131.215 ± 4882.751 ops/s ResourceLeakDetectorRecordBenchmark.recordWithHint 16 thrpt 20 74146.313 ± 4999.246 ops/s After: Benchmark (recordTimes) Mode Cnt Score Error Units ResourceLeakDetectorRecordBenchmark.record 8 thrpt 20 155281.969 ± 5301.399 ops/s ResourceLeakDetectorRecordBenchmark.record 16 thrpt 20 77866.239 ± 3821.054 ops/s ResourceLeakDetectorRecordBenchmark.recordWithHint 8 thrpt 20 153360.036 ± 8611.353 ops/s ResourceLeakDetectorRecordBenchmark.recordWithHint 16 thrpt 20 78670.804 ± 2399.149 ops/s ```	2017-10-19 12:21:21 -07:00
Carl Mastrangelo	83a19d5650	Optimistically update ref counts Motivation: Highly retained and released objects have contention on their ref count. Currently, the ref count is updated using compareAndSet with care to make sure the count doesn't overflow, double free, or revive the object. Profiling has shown that a non trivial (~1%) of CPU time on gRPC latency benchmarks is from the ref count updating. Modification: Rather than pessimistically assuming the ref count will be invalid, optimistically update it assuming it will be. If the update was wrong, then use the slow path to revert the change and throw an execption. Most of the time, the ref counts are correct. This changes from using compareAndSet to getAndAdd, which emits a different CPU instruction on x86 (CMPXCHG to XADD). Because the CPU knows it will modifiy the memory, it can avoid contention. On a highly contended machine, this can be about 2x faster. There is a downside to the new approach. The ref counters can temporarily enter invalid states if over retained or over released. The code does handle these overflow and underflow scenarios, but it is possible that another concurrent access may push the failure to a different location. For example: Time 1 Thread 1: obj.retain(INT_MAX - 1) Time 2 Thread 1: obj.retain(2) Time 2 Thread 2: obj.retain(1) Previously Thread 2 would always succeed and Thread 1 would always fail on the second access. Now, thread 2 could fail while thread 1 is rolling back its change. ==== There are a few reasons why I think this is okay: 1. Buggy code is going to have bugs. An exception _is_ going to be thrown. This just causes the other threads to notice the state is messed up and stop early. 2. If high retention counts are a use case, then ref count should be a long rather than an int. 3. The critical section is greatly reduced compared to the previous version, so the likelihood of this happening is lower 4. On error, the code always rollsback the change atomically, so there is no possibility of corruption. Result: Faster refcounting ``` BEFORE: Benchmark (delay) Mode Cnt Score Error Units AbstractReferenceCountedByteBufBenchmark.retainRelease_contended 1 sample 2901361 804.579 ± 1.835 ns/op AbstractReferenceCountedByteBufBenchmark.retainRelease_contended 10 sample 3038729 785.376 ± 16.471 ns/op AbstractReferenceCountedByteBufBenchmark.retainRelease_contended 100 sample 2899401 817.392 ± 6.668 ns/op AbstractReferenceCountedByteBufBenchmark.retainRelease_contended 1000 sample 3650566 2077.700 ± 0.600 ns/op AbstractReferenceCountedByteBufBenchmark.retainRelease_contended 10000 sample 3005467 19949.334 ± 4.243 ns/op AbstractReferenceCountedByteBufBenchmark.retainRelease_uncontended 1 sample 456091 48.610 ± 1.162 ns/op AbstractReferenceCountedByteBufBenchmark.retainRelease_uncontended 10 sample 732051 62.599 ± 0.815 ns/op AbstractReferenceCountedByteBufBenchmark.retainRelease_uncontended 100 sample 778925 228.629 ± 1.205 ns/op AbstractReferenceCountedByteBufBenchmark.retainRelease_uncontended 1000 sample 633682 2002.987 ± 2.856 ns/op AbstractReferenceCountedByteBufBenchmark.retainRelease_uncontended 10000 sample 506442 19735.345 ± 12.312 ns/op AFTER: Benchmark (delay) Mode Cnt Score Error Units AbstractReferenceCountedByteBufBenchmark.retainRelease_contended 1 sample 3761980 383.436 ± 1.315 ns/op AbstractReferenceCountedByteBufBenchmark.retainRelease_contended 10 sample 3667304 474.429 ± 1.101 ns/op AbstractReferenceCountedByteBufBenchmark.retainRelease_contended 100 sample 3039374 479.267 ± 0.435 ns/op AbstractReferenceCountedByteBufBenchmark.retainRelease_contended 1000 sample 3709210 2044.603 ± 0.989 ns/op AbstractReferenceCountedByteBufBenchmark.retainRelease_contended 10000 sample 3011591 19904.227 ± 18.025 ns/op AbstractReferenceCountedByteBufBenchmark.retainRelease_uncontended 1 sample 494975 52.269 ± 8.345 ns/op AbstractReferenceCountedByteBufBenchmark.retainRelease_uncontended 10 sample 771094 62.290 ± 0.795 ns/op AbstractReferenceCountedByteBufBenchmark.retainRelease_uncontended 100 sample 763230 235.044 ± 1.552 ns/op AbstractReferenceCountedByteBufBenchmark.retainRelease_uncontended 1000 sample 634037 2006.578 ± 3.574 ns/op AbstractReferenceCountedByteBufBenchmark.retainRelease_uncontended 10000 sample 506284 19742.605 ± 13.729 ns/op ```	2017-10-04 08:42:33 +02:00
回眸,境界	06da0ceb64	Fix typo in comment.	2017-10-02 08:16:22 +02:00
Carl Mastrangelo	003c8cc7ab	Expose all defaults on PooledByteBufAllocator Motivation: Most, but not all defaults are statically exposed on PooledByteBufAllocator. This makes it cumbersome to make a custom allocator where most of the defaults remain the same. Modification: Expose useCacheForAllThreads, and Direct preferred. The latter is needed because it is under the internal package, and public code should probably not depend on it. Result: More customizeable allocators	2017-09-20 21:48:53 -07:00
Norman Maurer	9d56439aa1	Make UnpooledDirectByteBuf, UnpooledHeapByteBuf and UnpooledUnsafeDirectByteBuf constructors public. Motivation: The constrcutors a protected atm but the classes are public. We should make the constructors public as well to make it easier to write your own ByteBufAllocator. Modifications: Change constructors to be public and add some javadocs. Result: Easier to create own ByteBufAllocator.	2017-09-18 21:42:46 -07:00
Carl Mastrangelo	d2cb51bc2e	Enable PooledByteBufAllocator to work, event without a cache Motivation: `useCacheForAllThreads` may be false which disables memory caching on non netty threads. Setting this argument or the system property makes it impossible to use `PooledByteBufAllocator`. Modifications: Delayed the check of `freeSweepAllocationThreshold` in `PoolThreadCache` to after it knows there will be any caches in use. Additionally, check if the caches will have any data in them (rather than allocating a 0-length array). A test case is also added that fails without this change. Results: Fixes #7194	2017-09-08 10:20:45 +02:00
Norman Maurer	bca35b0449	Allow to construct UnpooledByteBufAllocator that explictly always use sun.misc.Cleaner Motivation: When the user want to have the direct memory explicitly managed by the GC (just as java.nio does) it is useful to be able to construct an UnpooledByteBufAllocator that allows this without the chances to see any memory leak. Modifications: Allow to explicitly disable the usage of reflection to construct direct ByteBufs and so be sure these will be collected by GC. Result: More flexible way to use the UnpooledByteBufAllocator.	2017-08-31 12:57:09 +02:00
Carl Mastrangelo	7528e5a11e	Use threadsafe setter on Atomic Updaters Motivation: The documentation for field updates says: > Note that the guarantees of the {@code compareAndSet} > method in this class are weaker than in other atomic classes. > Because this class cannot ensure that all uses of the field > are appropriate for purposes of atomic access, it can > guarantee atomicity only with respect to other invocations of > {@code compareAndSet} and {@code set} on the same updater. This implies that volatiles shouldn't use normal assignment; the updater should set them. Modifications: Use setter for field updaters that make use of compareAndSet. Result: Concurrency compliant code	2017-08-31 10:14:40 +02:00
Norman Maurer	e487db7836	Use the ByteBufAllocator when copy a ReadOnlyByteBufferBuf and so also be able to release it without the GC when the Cleaner is present. Motivation: In ReadOnlyByteBufferBuf.copy(...) we just allocated a ByteBuffer directly and wrapped it. This way it was not possible for us to free the direct memory that was used by the copy without the GC. Modifications: - Ensure we use the allocator when create the copy and so be able to release direct memory in a timely manner - Add unit test - Depending on if the to be copied buffer is direct or heap based we also allocate the same type on copy. Result: Fixes [#7103].	2017-08-16 07:33:10 +02:00
Nikolay Fedorovskikh	0774c91456	Support the little endian floats and doubles by ByteBuf Motivation: `ByteBuf` does not have the little endian variant of float/double access methods. Modifications: Add support for little endian floats and doubles into `ByteBuf`. Result: `ByteBuf` has get/read/set/writeFloatLE() and get/read/set/writeDoubleLE() methods. Fixes [#6576].	2017-08-15 06:24:28 +02:00
louyl	8be9a63c1c	FIX endless loop in ByteBufUtil#writeAscii Motivation: Missing return in ByteBufUtil#writeAscii causes endless loop Modifications: Add return after write finished Result: ByteBufUtil#writeAscii is ok	2017-08-12 12:50:00 +02:00
Scott Mitchell	452fd36240	ByteBufs which are not resizable should not throw in ensureWritable(int,boolean) Motivation: ByteBuf#ensureWritable(int,boolean) returns an int indicating the status of the resize operation. For buffers that are unmodifiable or cannot be resized this method shouldn't throw but just return 1. ByteBuf#ensureWriteable(int) should throw unmodifiable buffers. Modifications: - ReadOnlyByteBuf should be updated as described above. - Add a unit test to SslHandler which verifies the read only buffer can be tolerated in the aggregation algorithm. Result: Fixes https://github.com/netty/netty/issues/7002.	2017-07-22 08:44:48 -07:00
Norman Maurer	06f64948d5	Add tests to ensure an IllegalReferenceCountException is thrown if set/writeCharSequence is called on a released buffer Motivation: We need to ensure we not allow calling set/writeCharsequence on an released ByteBuf. Modifications: Add test-cases Result: Proves fix of [#6951].	2017-07-21 07:39:32 +02:00
Norman Maurer	4af47f0ced	AbstractByteBuf.setCharSequence(...) must not expand buffer Motivation: AbstractByteBuf.setCharSequence(...) must not expand the buffer if not enough writable space is present in the buffer to be consistent with all the other set operations. Modifications: - Ensure we only exand the buffer on writeCharSequence(...) but not on setCharSequence(...) - Add unit tests. Result: Consistent and correct behavior.	2017-07-19 19:44:53 +02:00
Norman Maurer	d125adec38	AbstractByteBuf.ensureWritable(...) should check if buffer was released Motivation: AbstractByteBuf.ensureWritable(...) should check if buffer was released and if so throw an IllegalReferenceCountException Modifications: Ensure we throw in all cases. Result: More consistent and correct behaviour	2017-07-19 07:34:08 +02:00
louxiu	0ad99310f5	Record release when enable detailed leak detection Motivation: It would be easier to find where is missing release call in several retain release calls on a ByteBuf Modifications: Remove final modifier on SimpleLeakAwareByteBuf and SimpleLeakAwareByteBuf release function and override it to record release in AdvancedLeakAwareByteBuf and AdvancedLeakAwareCompositeByteBuf Result: Release will be recorded when enable detailed leak detection	2017-07-18 09:28:56 +02:00
Scott Mitchell	86e653e04f	SslHandler aggregation of plaintext data on write Motivation: Each call to SSL_write may introduce about ~100 bytes of overhead. The OpenSslEngine (based upon OpenSSL) is not able to do gathering writes so this means each wrap operation will incur the ~100 byte overhead. This commit attempts to increase goodput by aggregating the plaintext in chunks of <a href="https://tools.ietf.org/html/rfc5246#section-6.2">2^14</a>. If many small chunks are written this can increase goodput, decrease the amount of calls to SSL_write, and decrease overall encryption operations. Modifications: - Introduce SslHandlerCoalescingBufferQueue in SslHandler which will aggregate up to 2^14 chunks of plaintext by default - Introduce SslHandler#setWrapDataSize to control how much data should be aggregated for each write. Aggregation can be disabled by setting this value to <= 0. Result: Better goodput when using SslHandler and the OpenSslEngine.	2017-07-10 12:22:08 -07:00
Nikolay Fedorovskikh	df568c739e	Use ByteBuf#writeShort/writeMedium instead of writeBytes Motivation: 1. Some encoders used a `ByteBuf#writeBytes` to write short constant byte array (2-3 bytes). This can be replaced with more faster `ByteBuf#writeShort` or `ByteBuf#writeMedium` which do not access the memory. 2. Two chained calls of the `ByteBuf#setByte` with constants can be replaced with one `ByteBuf#setShort` to reduce index checks. 3. The signature of method `HttpHeadersEncoder#encoderHeader` has an unnecessary `throws`. Modifications: 1. Use `ByteBuf#writeShort` or `ByteBuf#writeMedium` instead of `ByteBuf#writeBytes` for the constants. 2. Use `ByteBuf#setShort` instead of chained call of the `ByteBuf#setByte` with constants. 3. Remove an unnecessary `throws` from `HttpHeadersEncoder#encoderHeader`. Result: A bit faster writes constants into buffers.	2017-07-10 14:37:41 +02:00
Norman Maurer	83db2b07b4	Also use realloc when shrink the buffer. Motivation: We should also use realloc when shrink the buffer to eliminate extra allocations / memory copies when possible. Modifications: Use realloc for expanding and shrinking when possible. Result: Less memory copies and allocations	2017-07-06 20:03:15 +02:00
Nikolay Fedorovskikh	f35047765f	Avoid a double check ByteBuf#ensureWritable in ByteBufUtil Motivation: Methods `ByteBufUtil#writeUtf8` and `ByteBufUtil#writeAscii` contains a check `ByteBuf#ensureWritable` before the calling `ByteBuf#writeBytes`. But the `ByteBuf#writeBytes` also do a such check inside. Modifications: Make checks more targeted. Result: Less redundant method calls.	2017-06-28 19:00:01 +02:00
Nikolay Fedorovskikh	ba3616da3e	Apply appropriate methods for writing CharSequence into ByteBuf Motivation: 1. `ByteBuf` contains methods to writing `CharSequence` which optimized for UTF-8 and ASCII encodings. We can also apply optimization for ISO-8859-1. 2. In many places appropriate methods are not used. Modifications: 1. Apply optimization for ISO-8859-1 encoding in the `ByteBuf#setCharSequence` realizations. 2. Apply appropriate methods for writing `CharSequences` into buffers. Result: Reduce overhead from string-to-bytes conversion.	2017-06-27 07:58:39 +02:00
Nikolay Fedorovskikh	01eb428b39	Move methods for decode hex dump into StringUtil Motivation: PR #6811 introduced a public utility methods to decode hex dump and its parts, but they are not visible from netty-common. Modifications: 1. Move the `decodeHexByte`, `decodeHexDump` and `decodeHexNibble` methods into `StringUtils`. 2. Apply these methods where applicable. 3. Remove similar methods from other locations (e.g. `HpackHex` test class). Result: Less code duplication.	2017-06-23 18:52:42 +02:00
Norman Maurer	7922757575	Allow to access memoryAddress of wrapped ByteBuf for ReadOnlyByteBuf Motivation: We should allow to access the memoryAddress of the wrapped ByteBuf when using ReadOnlyByteBuf for peformance reasons. If a user act on a memoryAddress its his responsible anyway to do nothing "stupid". Modifications: Delegate to wrapped ByteBuf. Result: Less performance overhead for various operations and also when writing to a native transport (which needs the memoryAddress).	2017-06-07 18:45:26 +02:00
Renjie Sun	629b83e0a5	Move QueryStringDecoder.decodeHexByte into ByteBufUtil Motivations: 1. There are duplicated implementations of decoding hex strings. #6797 2. ByteBufUtil.HexUtil.decodeHexDump does not handle substring start index properly and does not decode hex byte rigorously. Modifications: 1. Function decodeHexByte is moved from QueryStringDecoder into ByteBufUtil. 2. ByteBufUtil.HexUtil.decodeHexDump is changed to use decodeHexByte. 3. Tests are Updated accordingly. Result: Fixed #6797 and made hex decoding functions more robust.	2017-06-07 09:27:36 -07:00
Scott Mitchell	b71abcedd1	ByteBufUtil#decodeHexDump Motivation: ByteBufUtil provides a hexDump method. For debugging purposes it is often useful to decode that hex dump to get the original content, but no such method exists. Modifications: - Add ByteBufUtil#decodeHexDump Result: ByteBufUtil#decodeHexDump is available to make debugging easier.	2017-05-30 15:20:54 -07:00
Scott Mitchell	63f5cdb0d5	ByteBuf#ensureWritable(int, boolean) should not throw Motivation: The javadocs for ByteBuf#ensureWritable(int, boolean) indicate that it should not throw, and instead the return code should indicate the result of the operation. Due to a bug in AbstractByteBuf it is possible for a resize to be attempted on a buffer that may exceed maxCapacity() and therefore throw. Modifications: - If there is not enough space in the buffer, and force is false, then a resize should not be attempted Result: AbstractByteBuf#ensureWritable(int, boolean) enforces the javadoc constraints and does not throw.	2017-05-09 00:12:25 -07:00
Norman Maurer	b662afeece	Correctly release all buffers in UnpooledTest Motivation: We not correctly released all buffers in the UnpooledTest and so showed "bad" way of handling buffers to people that inspect our code to understand when a buffer needs to be released. Modifications: Explicit release all buffers. Result: Cleaner and more correct code.	2017-04-27 19:29:45 +02:00
Jason Tedor	98beb777f8	Enable configuring available processors Motivation: In cases when an application is running in a container or is otherwise constrained to the number of processors that it is using, the JVM invocation Runtime#availableProcessors will not return the constrained value but rather the number of processors available to the virtual machine. Netty uses this number in sizing various resources. Additionally, some applications will constrain the number of threads that they are using independenly of the number of processors available on the system. Thus, applications should have a way to globally configure the number of processors. Modifications: Rather than invoking Runtime#availableProcessors, Netty should rely on a method that enables configuration when the JVM is started or by the application. This commit exposes a new class NettyRuntime for enabling such configuraiton. This value can only be set once. Its default value is Runtime#availableProcessors so that there is no visible change to existing applications, but enables configuring either a system property or configuring during application startup (e.g., based on settings used to configure the application). Additionally, we introduce the usage of forbidden-apis to prevent future uses of Runtime#availableProcessors from creeping. Future work should enable the bundled signatures and clean up uses of deprecated and other forbidden methods. Result: Netty can be configured to not use the underlying number of processors, but rather the constrained number of processors.	2017-04-23 10:31:17 +02:00
Norman Maurer	bf0beb772c	Fix IllegalArgumentException when release a wrapped ByteBuffer on Java9 Motivation: Unsafe.invokeCleaner(...) checks if the passed in ByteBuffer is a slice or duplicate and if so throws an IllegalArgumentException on Java9. We need to ensure we never try to free a ByteBuffer that was provided by the user directly as we not know if its a slice / duplicate or not. Modifications: Never try to free a ByteBuffer that was passed into UnpooledUnsafeDirectByteBuf constructor by an user (via Unpooled.wrappedBuffer(....)). Result: Build passes again on Java9	2017-04-20 19:19:11 +02:00
Nikolay Fedorovskikh	0692bf1b6a	fix the typos	2017-04-20 04:56:09 +02:00
Norman Maurer	e482d933f7	Add 'io.netty.tryAllocateUninitializedArray' system property which allows to allocate byte[] without memset in Java9+ Motivation: Java9 added a new method to Unsafe which allows to allocate a byte[] without memset it. This can have a massive impact in allocation times when the byte[] is big. This change allows to enable this when using Java9 with the io.netty.tryAllocateUninitializedArray property when running Java9+. Please note that you will need to open up the jdk.internal.misc package via '--add-opens java.base/jdk.internal.misc=ALL-UNNAMED' as well. Modifications: Allow to allocate byte[] without memset on Java9+ Result: Better performance when allocate big heap buffers and using java9.	2017-04-19 11:45:39 +02:00
Scott Mitchell	21562d8808	Retained[Duplicate\|Slice] operations should not increase the reference count for UnreleasableByteBuf Motivation: UnreleasableByteBuf operations are designed to not modify the reference count of the underlying buffer. The Retained[Duplicate\|Slice] operations violate this assumption and can cause the underlying buffer's reference count to be increased, but never allow for it to be decreased. This may lead to memory leaks. Modifications: - UnreleasableByteBuf's Retained[Duplicate\|Slice] should leave the reference count of the parent buffer unchanged after the operation completes. Result: No more memory leaks due to usage of the Retained[Duplicate\|Slice] on an UnreleasableByteBuf object.	2017-03-31 17:45:29 -07:00
Scott Mitchell	ef21d5f4ca	UnsafeByteBufUtil errors and simplification Motiviation: UnsafeByteBufUtil has some bugs related to using an incorrect index, and also omitting the array paramter when dealing with byte[] objects. There is also some simplification possible with respect to type casting, and minor formatting consistentcy issues. Modifications: - Ensure indexing is correct when dealing with native memory - Fix the native access and endianness for the medium/unsigned medium methods - Ensure array is used when dealing with heap memory - Remove unecessary casts when using long - Fix formating and alignment Result: UnsafeByteBufUtil is more correct and won't access direct memory when heap arrays are used.	2017-03-30 11:52:03 -07:00
Norman Maurer	6036b3f6ea	Fix buffer leak in EmptyByteBufTest introduced by `aa2f16f314`	2017-03-27 05:20:02 +02:00
Bryce Anderson	aa2f16f314	EmptyByteBuf allows writing ByteBufs with 0 readable bytes Motivation: The contract of `ByteBuf.writeBytes(ByteBuf src)` is such that it will throw an `IndexOutOfBoundsException if `src.readableBytes()` is greater than `this.writableBytes()`. The EmptyByteBuf class will throw the exception, even if the source buffer has zero readable bytes, in violation of the contract. Modifications: Use the helper method `checkLength(..)` to check the length and throw the exception, if appropriate. Result: Conformance with the stated behavior of ByteBuf.	2017-03-21 22:00:54 -07:00
Nikolay Fedorovskikh	2993760e92	Fix misordered 'assertEquals' arguments in tests Motivation: Wrong argument order in some 'assertEquals' applying. Modifications: Flip compared arguments. Result: Correct `assertEquals` usage.	2017-03-08 22:48:37 -08:00
Norman Maurer	3ad3356892	Expose ByteBufAllocator metric in a more general way Motivation: PR [#6460] added a way to access the used memory of an allocator. The used naming was not very good and how things were exposed are not consistent. Modifications: - Add a new ByteBufAllocatorMetric and ByteBufAllocatorMetricProvider interface - Let the ByteBufAllocator implementations implement ByteBufAllocatorMetricProvider - Move exposed stats / metric from PooledByteBufAllocator to PooledByteBufAllocatorMetric and mark old methods as `@Deprecated`. Result: More consistent way to expose metric / stats for ByteBufAllocator	2017-03-08 20:07:58 +01:00
Scott Mitchell	2cff918044	Correct usages of internalNioBuffer Motivation: There are numerous usages of internalNioBuffer which hard code 0 for the index when the intention was to use the readerIndex(). Modifications: - Remove hard coded 0 for the index and use readerIndex() Result: We are less susceptible to using the wrong index, and don't make assumptions about the ByteBufAllocator.	2017-03-02 12:51:22 -08:00
Norman Maurer	461f9a1212	Allow to obtain informations of used direct and heap memory for ByteBufAllocator implementations Motivation: Often its useful for the user to be able to get some stats about the memory allocated via an allocator. Modifications: - Allow to obtain the used heap and direct memory for an allocator - Add test case Result: Fixes [#6341]	2017-03-01 18:53:43 +01:00
Norman Maurer	a7fe6c0153	Metrics exposed by PooledByteBufAllocator needs to be correctly synchronized Motivation: As we may access the metrics exposed of PooledByteBufAllocator from another thread then the allocations happen we need to ensure we synchronize on the PoolArena to ensure correct visibility. Modifications: Synchronize on the PoolArena to ensure correct visibility. Result: Fix multi-thread issues on the metrics	2017-03-01 06:26:08 +01:00
Norman Maurer	deb90923a2	Ensure PooledByteBuf.initUnpooled(...) correctly set the allocator Motivation: Commit `8dda984afe` introduced a regression which lead to the situation that the allocator is not set when PooledByteBuf.initUnpooled(...) is called. Thus it was possible that PooledByteBuf.alloc() returns null or the wrong allocator if multiple PooledByteBufAllocator are used in an application. Modifications: - Correctly set the allocator - Add test-case Result: Fixes [#6436].	2017-02-23 19:53:17 +01:00
Nikolay Fedorovskikh	0623c6c533	Fix javadoc issues Motivation: Invalid javadoc in project Modifications: Fix it Result: More correct javadoc	2017-02-22 07:31:07 +01:00
Norman Maurer	fbf0e5f4dd	Prefer JDK ThreadLocalRandom implementation over ours. Motivation: We have our own ThreadLocalRandom implementation to support older JDKs . That said we should prefer the JDK provided when running on JDK >= 7 Modification: Using ThreadLocalRandom implementation of the JDK when possible. Result: Make use of JDK implementations when possible.	2017-02-16 15:44:00 -08:00

1 2 3 4 5 ...

678 Commits