xserver-multidpi

Author	SHA1	Message	Date
Brian Paul	644e05562e	Remove useless return statement Reviewed-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:54 -08:00
Brian Paul	8c51eb8239	Remove redundant dispatch->glEnable(GL_TEXTURE_2D) The same call was already made a few lines earlier. Reviewed-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:54 -08:00
Brian Paul	3bf1eb577e	Fix _glamor_set_spans() bug (re-used 'n' variable) n was used as a function parameter. But inside the for (i=1..n) loop, n got reassigned as REGION_NUM_RECTS() and then decremented to zero by the while loop. This caused the for loop to only iterate once instead of 'n' times. This patch renames the n parameter to numPoints. Found by code inspection. Untested. Reviewed-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:54 -08:00
Grigori Goronzy	2f62bd46cc	glamor_render: fix PictFilters Add Fast/Good/Best and appropriately map to Nearest and Bilinear. Additionally, add a fallback path for unsupported filters. Notably, this fixes window shadow rendering with Compiz, which uses PictFilterConvolution for some odd reason. Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:54 -08:00
Grigori Goronzy	5695708ecd	Use GL_STATIC_DRAW for element index buffer The buffer never changes anyway. Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:54 -08:00
Grigori Goronzy	8afa008ec4	Use glDrawRangeElements instead of glDrawElements This lets us explicitly specify the range of vertices that are used, which the OpenGL driver can use for optimization. Particularly, it results in lower CPU overhead with Mesa-based drivers. Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:54 -08:00
Zhigang Gong	229601e080	Shoud return null subpixmap if we fail to get a valid map address. The patch is prepared by Raul Fernandes. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=86693 Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:54 -08:00
Dave Airlie	e3d1d4e3ca	glamor: add initial Xv support This does YV12 and I420 for now, not sure if we can do packed without a GL extension. Signed-off-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2013-12-18 11:23:54 -08:00
Michel Dänzer	39e95cd0f5	Reset traps_count and ptrap when necessary for the next trapezoid cliprect Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=64912 Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: He Junyan <junyan.he@inbox.com> Reviewed-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:54 -08:00
Michel Dänzer	61fca4342a	Fix RegionContainsRect test for PutImage The return value of RegionContainsRect() is not a boolean but an enum indicating that the region contains the rectangle completely, partially or not at all. We can only take the PutImage fastpath when the region contatins the rectangle completely. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=65964 Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:54 -08:00
Christian König	2ea618f2cf	Use GBM_LIBS and GBM_CFLAGS Signed-off-by: Christian König <christian.koenig at amd.com> Reviewed-by: Michel Dänzer <michel.daenzer at amd.com>	2013-12-18 11:23:54 -08:00
Armin K	7e818f7d39	First attempt to make libglamor.so shared versioned library As recommended by Michel in this thread reply: http://lists.freedesktop.org/archives/glamor/2013-March/000305.html v2: Correct shared library location in glamor.pc.in Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=62259 Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2013-12-18 11:23:53 -08:00
Armin K	b0318b6de7	Properly dist necesary headers Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2013-12-18 11:23:53 -08:00
Armin K	fc179bb863	Silence Automake 1.13 warnings warning: 'INCLUDES' is the old name for 'AM_CPPFLAGS' (or '*_CPPFLAGS') Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2013-12-18 11:23:53 -08:00
Michel Dänzer	97416e3f14	glamoregl: Use xf86ScreenToScrn() Fixes crashes when glamor is used for a GPU screen with xserver 1.13 or newer. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=57200#c17 Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk>	2013-12-18 11:23:53 -08:00
Dave Airlie	4c5bfd529f	glamor_utils: fix unlikely define use using a define across a split line expression is failure, compiling with warnings shows this up. Signed-off-by: Dave Airlie <airlied@redhat.com>	2013-12-18 11:23:53 -08:00
Dave Airlie	6b954880c2	glamor: add compiler.h This is also required for distchecking. Signed-off-by: Dave Airlie <airlied@redhat.com>	2013-12-18 11:23:53 -08:00
Dave Airlie	efbdc9e90f	glamor: fix make distcheck part 1 This just adds the headers, then it falls over on the sdk_HEADERS as it overrides proper install paths by the looks of it. Signed-off-by: Dave Airlie <airlied@redhat.com>	2013-12-18 11:23:53 -08:00
Zhigang Gong	80f5e21dae	glamor_compositerects: Need to initialize region before fallback. As we need to call DamageRegionAppend even for fallback path, we must initialize the region before do that. Pointed by Igor Vagulin. https://bugs.freedesktop.org/show_bug.cgi?id=56940 Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:53 -08:00
Michel Dänzer	14e02f5132	Don't use glBlitFramebufferEXT for overlapping copies. According to the GL_EXT_framebuffer_blit spec, the result of doing so is undefined. But we need well-defined results. :) Signed-off-by: Michel Dänzer <michel.daenzer@amd.com>	2013-12-18 11:23:53 -08:00
Zhigang Gong	e846f48f48	Increase vbo size to 64K verts. This commit will benefit vertex stressing cases such as aa10text/rgb10text, and can get about 15% performance gain. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com> Acked-by: Junyan <junyan.he@linux.intel.com>	2013-12-18 11:23:53 -08:00
Zhigang Gong	b8f0a21882	Silence compilation warnings. After increase to gcc4.7, it reports more warnings, now fix them. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com> Tested-by: Junyan He<junyan.he@linux.intel.com>	2013-12-18 11:23:53 -08:00
Zhigang Gong	50614451ad	glamor_largepixmap: Fixed a bug in repeat clipping. If the repeat direction only has one block, then we need to set the dx/dy to cover all the extent. This commit also silence some compilation warnings. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:53 -08:00
Michel Dänzer	7eb434918b	Prefer KHR_surfaceless_context EGL extension over KHR_surfaceless_opengl/gles2. Current Mesa Git only advertises the former instead of the latter. Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:53 -08:00
Michel Dänzer	59653fa08a	Print space between name of missing EGL extension and 'required'. Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:53 -08:00
Junyan He	c3096c5a56	Fallback to pixman when trapezoid mask is big. The trapezoid generating speed of the shader is relatively slower when the trapezoid area is big. We fallback when the trapezoid's width and height is bigger enough. The big traps number will also slow down the render because of the VBO size. We fallback if ntrap > 256 Signed-off-by: Junyan He <junyan.he@linux.intel.com> Reviewed-By: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:53 -08:00
Zhigang Gong	f62f4d53ef	glamor_glyphs: When dst arg point to a NULL buffer, dont't flush. This is a corner case, when we render glyphs via mask cache, and when we need to upload new glyphs cache, we need to flush both the mask and dest buffer. But we the dest arg may point to a NULL buffer at that time, we need to check it firstly. If the dest buffer is NULL. Then we don't need to flush both the dest and mask buffer. This commit fix a potential crash. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:53 -08:00
Zhigang Gong	e7af7cb76d	glamor_trapezoid: workaround a glsl like problem. It seems that the following statement cann't run as expected on SNB. bool trap_left_vertical = (abs(trap_left_vertical_f - 1.0) <= 0.0001); Have to rewrite it to another style to let the vertical edge trapezoid to be rendered correctly. Reviewed-by: Junyan He <junyan.he@linux.intel.com> Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:53 -08:00
Junyan He	5512c14e34	Fix the problem of VBO leak. In some cases we allocate the VBO but have no vertex to emit, which cause the VBO fail to be released. Fix it. Signed-off-by: Junyan He <junyan.he@linux.intel.com>	2013-12-18 11:23:53 -08:00
Junyan He	9f78e22fa6	Just use the shader to generate trapezoid if PolyMode == Imprecise The precise mode of trapezoid rendering need to sample the trapezoid on the centre points of an (2n+1)x(2n-1) subpixel grid. It is computationally expensive in shader, and we use inside area ratio to replace it. The result has some difference, and we just use it if the polymode == Imprecise. Signed-off-by: Junyan He <junyan.he@linux.intel.com>	2013-12-18 11:23:53 -08:00
Junyan He	fe024a7822	Change the trapezoid render to use VBO. Because some uniform variables need to be set for every trapezoid rendering, we can not use vbo to render multi trapezoids one time, which have performance big loss. We now add attributes which contain the same value to bypass the uniform variable problem. The uniform value for one trapezoid will be set to the same value to all the vertex of that trapezoid as an attribute, then in FS, it is still a constant. Signed-off-by: Junyan He <junyan.he@linux.intel.com>	2013-12-18 11:23:53 -08:00
Zhigang Gong	9dff3378e5	Added the missed header file for xorg 1.13 compat. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:53 -08:00
Zhigang Gong	bc1b412b3b	Synch with xorg 1.13 change. As xorg 1.13 change the scrn interaces and remove those global arrays. Some API change cause we can't build. Now fix it. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:53 -08:00
Zhigang Gong	4c27ca4700	gles2: Fixed the compilation problem and some bugs. Previous patch doesn't set the offset to zero for GLESv2 path. Now fix it. This patch also fix a minor problem in pixmap uploading preparation. If the revert is not REVERT_NORMAL, then we don't need to prepare a fbo for it. As current mesa i965 gles2 driver doesn't support to set a A8 texture as a fbo target, we must fix this problem. As some A1/A8 picture need to be uploaded, this is the only place a A8 texture may be attached to a fbo. This patch also enable the shader gradient for GLESv2. The reason we disable it before is that some glsl linker doesn't support link different objects which have cross reference. Now we don't have that problem. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:53 -08:00
Michel Dänzer	006fe0e66d	Stream vertex data to VBOs. Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk> Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:53 -08:00
Michel D=C3=A4nzer	551ca11c77	Fix translation of clip region for composite fallback. Fixes incorrectly clipped rendering. E.g. the cursor in Evolution composer windows became invisible. Signed-off-by: Michel Daenzer <michel.daenzer@amd.com> Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:53 -08:00
Zhigang Gong	88c317fb1e	glamor_glyphs: Don't merge extents for different lists. If we merge all lists's extent together, than we may have some fail overlap checking. Here is a simple: A E B F C D The first list has vertical "ABCD". And the second list has two char "EF". When detecting E, it can successfully find it doesn't overlap with previous glyphs. But after that, the original code will merge the previous extent with E's extent, then the extent will cover "F", so when detecting F, it will be treated as overlapped. We can simply solve this issue by not merge extent from different list. We can union different list's extent to a global region. And then do the intersect checkint between that region and current glyph extent, then we can avoid that fail checking. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:53 -08:00
Zhigang Gong	32a7438bf7	glamor_copyarea: Use blitcopy if current state is not render. Practically, for pure 2D blit, the blit copy is much faster than textured copy. For the x11perf copywinwin100, it's about 3x faster. But if we have heavy rendering/compositing, then use textured copy will get much better (>30%)performance for most of the cases. So we simply add a data element to track current state. For rendering state we use textured copy, otherwise, we use blit copy. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:53 -08:00
Zhigang Gong	0706423bcf	glamor_glyphs: Use cache picture to store mask picture if possible. By default, mask picture is newly created, and each time we need to clear the whole mask picture, and then composite glyphs to the mask picture and then composite the mask picture to destination. Testing results shows that the filling of the mask picture takes a big portion of the rendering time. As we don't really need to clear the whole region, we just need to clear the real overlapped region. This commit is to solve this issue. We split a large glyphs list to serval lists and each list is non-overlapped or overlapped. we can reduce the length of overlapped glyphs to do the glyphs_via_mask to 2 or 3 glyphs one time for most cases. Thus it give us a case to allocate a small portion of the corresponding cache directly as the mask picture. Then we can rendering the glyphs to this mask picture, and latter we can accumulate the second steps, composite the mask to the dest with the other non-overlapped glyphs's rendering process. It also make us implement a batch mask cache blocks clearing algorithm to avoid too frequently small region clearing. If there is no any overlapping, this method will not get performance gain. If there is some overlapping, then this algorithm can get about 15% performance gain. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:52 -08:00
Zhigang Gong	4d1a2173f2	glamor_compositerects: Implement optimized version. Don't call miCompositeRects. Use glamor_composite_clipped_region to render those boxes at once. Also add a new function glamor_solid_boxes to fill boxes at once. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:52 -08:00
Zhigang Gong	dd79243398	optimize: Use likely and unlikely. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:52 -08:00
Zhigang Gong	d5f03ba010	create_pixmap: use texture for large glyphs. As we only cache glyphs smaller than 64x64, we need to use texutre for the large glyphs. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:52 -08:00
Zhigang Gong	90dd6ddbab	glamor_copyarea: Fixed a bug introduced by 996194... Default return value should be FALSE. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:52 -08:00
Zhigang Gong	b8dd2a597d	glamor_glyphs: Slightly performance tuning. As glamor_glyphs never fallback, we don't need to keep the underlying glyphs routines, just override the ps->glys Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:52 -08:00
Zhigang Gong	3873d412f0	glamor_render: Don't allocate buffer for vbo each time. We can reuse the last one if the last one is big enough to contain current vertext data. In the meantime, Use MapBufferRange instead of MapBuffer. Testing shows, this patch brings some benefit for aa10text/rgb10text. Not too much, but indeed faster. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:52 -08:00
Zhigang Gong	682f5d2989	glamor_largepixmap: Walkaround for large texture's upload. I met a problem with large texture (larger than 7000x7000)'s uploading on SNB platform. The map_gtt get back a mapped VA without error, but write to that virtual address triggers BUS error. This work around is to avoid that direct uploading. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:52 -08:00
Zhigang Gong	37d4022f01	glamor_render: Optimize the two pass ca rendering. For the componentAlpha with PictOpOver, we use two pass rendering to implement it. Previous implementation call two times the glamor_composite_... independently which is very inefficient. Now we change the control flow, and do the two pass internally and avoid duplicate works. For the x11perf -rgb10text, this optimization can get about 30% improvement. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:52 -08:00
Zhigang Gong	21916cf84f	glamor_composite_glyph: Optimize glyphs with non-solid pattern. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:52 -08:00
Zhigang Gong	c1bd50d58d	glamor_glyphs: Detect fake or real glyphs overlap. To split a glyph's extent region to three sub-boxes as below. left box 2 x h center box (w-4) x h right box 2 x h Take a simple glyph A as an example: * __* __ **** * * ~~ ~~ The left box and right boxes are both 2 x 2. The center box is 2 x 4. The left box has two bitmaps 0001'b and 0010'b to indicate the real inked area. The right box also has two bitmaps 0010'b and 0001'b. And then we can check the inked area in left and right boxes with previous glyph. If the direction is from left to right, then we need to check the previous right bitmap with current left bitmap. And if we found the center box has overlapped or we overlap with not only the previous glyph, we will treat it as real overlapped and will render the glyphs via mask. If we only intersect with previous glyph on the left/right edge. Then we further compute the real overlapped bits. We set a loose check criteria here, if it has less than two pixel overlapping, we treat it as non-overlapping. With this patch, The aa10text boost fom 1660000 to 320000. Almost double the performance! And the cairo test result is the same as without this patch. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:52 -08:00
Zhigang Gong	ea4c22716c	glamor_render: Don't fallback when rendering glyphs with OpOver. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:52 -08:00
Zhigang Gong	7acbe89561	glamor_create_pixmap: Allocate glyphs pixmap in memory. As we have glyphs atlas cache, we don't need to hold each glyphs on GPU. And for the subsequent optimization, we need to store the original glyphs pixmap on system memory. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:52 -08:00
Zhigang Gong	1e4fc85a71	glamor_fbo: fix a memory leak for large pixmap. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:52 -08:00
Junyan He	2122e60bf9	Fix a bug for trapezoid clip We find in some cases the trapezoid will be render as a triangle and the left edge and right edge will cross with each other just bellow the top or over the bottom. The distance between the cross poind and the top or bottom is less than pixman_fixed_1_minus_e, so after the fixed converted to int, the cross point has the same value with the top or botton and the triangle should not be affected. But in our clip logic, the cross point will be clipped out. So add a logic to fix this problem. Signed-off-by: Junyan He <junyan.he@linux.intel.com>	2013-12-18 11:23:52 -08:00
Zhigang Gong	6ed418d17b	gles2_largepixmap: force clip for a non-large pixmap. One case we need force clip when download/upload a drm_texture pixmap. Actually, this is only meaningful for testing purpose. As we may set the max_fbo_size to a very small value, but the drm texture may exceed this value but the drm texture pixmap is not largepixmap. This is not a problem with OpenGL. But for GLES2, we may need to call glamor_es2_pixmap_read_prepare to create a temporary fbo to do the color conversion. Then we have to force clip the drm pixmap here to avoid large pixmap handling at glamor_es2_pixmap_read_prepare. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:52 -08:00
Zhigang Gong	c41d5c79e7	glamor_emit_composite_vert: Optimize to don't do two times vert coping. We change some macros to put the vert to the vertex buffer directly when we cacluating it. This way, we can get about 4% performance gain. This commit also fixed one RepeatPad bug, when we RepeatPad a not eaxct size fbo. We need to calculate the edge. The edge should be 1.0 - half point, not 1.0. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:52 -08:00
Zhigang Gong	8656ddbbe7	glamor_glyphs: Before get upload to cache flush is needed. When we can't get a cache hit and have to evict one cache entry to upload new picture, we need to flush the previous buffer. Otherwise, we may get corrupt glyphs rendering result. This is the reason why user-font-proxy may fail sometimes. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:52 -08:00
Zhigang Gong	016995334e	copyarea: Cleanup the error handling logic. Should use ok rather than mixed ok or ret. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:52 -08:00
Zhigang Gong	b4a499b7db	trapezoid: Fallback to sw-rasterize for largepixmap. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:52 -08:00
Junyan He	8f31aed48c	Use the direct render path for A1 Because when mask depth is 1, there is no Anti-Alias at all, in this case, the directly render can work well and it is faseter. Signed-off-by: Junyan He <junyan.he@linux.intel.com>	2013-12-18 11:23:52 -08:00
Junyan He	fa74a213ad	Add the trapezoid direct render logic We firstly get the render area by clipping the trapezoid with the clip rect, then split the clipped area into small triangles and use the composite logic to generate the result directly. This manner is fast but have the problem that some implementation of GL do not implement the Anti-Alias of triangles fill, so the edge sometimes has sawtooth. It is not acceptable when use trapezoid to approximate circles and wide lines. Signed-off-by: Junyan He <junyan.he@linux.intel.com>	2013-12-18 11:23:52 -08:00
Junyan He	5f1560c84a	Modilfy the composite logic to two phases We seperate the composite to two phases, firstly to select the shader according to source type and logic op, setting the right parameters. Then we emit the vertex array to generate the dest result. The reason why we do this is that the shader may be used to composite no only rect, trapezoid and triangle render function can also use it to render triangles and polygens. The old function glamor_composite_with_shader do the whole two phases work and can not match the new request. Signed-off-by: Junyan He <junyan.he@linux.intel.com>	2013-12-18 11:23:52 -08:00
Junyan He	0b0391765f	Add macro of vertex setting for triangle stripe Add macro of vertex setting for triangle stripe draw, and make the code clear. Signed-off-by: Junyan He <junyan.he@linux.intel.com>	2013-12-18 11:23:52 -08:00
RobinHe	bd180be619	Use shader to generate the temp trapezoid mask The old manner of trapezoid render uses pixman to generate a mask pixmap and upload it to the GPU. This effect the performance. We now use shader to generate the temp trapezoid mask to avoid the uploading of this pixmap. We implement a anti-alias manner in the shader according to pixman, which will caculate the area inside the trapezoid dividing total area for every pixel and assign it to the alpha value of that pixel. The pixman use a int-to-fix manner to approximate but the shader use float, so the result may have some difference. Because the array in the shader has optimization problem, we need to emit the vertex of every trapezoid every time, which will effect the performance a lot. Need to improve it. Signed-off-by: Junyan He <junyan.he@linux.intel.com>	2013-12-18 11:23:52 -08:00
RobinHe	6dd81c5939	Create the file glamor_triangles.c Create the file glamor_trapezoid.c, extract the logic relating to trapezoid from glamor_render.c to this file. Signed-off-by: Junyan He <junyan.he@linux.intel.com>	2013-12-18 11:23:52 -08:00
Zhigang Gong	bf38ee407b	Enable large pixmap by default. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:52 -08:00
Zhigang Gong	8ca16754f7	largepixmap: Fix the selfcopy issue. If the source and destination are the same pixmap/fbo, and we need to split the copy to small pieces. Then we do need to consider the sequence of the small pieces when the copy area has overlaps. This commit take the reverse/upsidedown into the clipping function, thus it can generate correct sequence and avoid corruption self copying. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:51 -08:00
Zhigang Gong	5325c800f7	largepixmap: Support self composite for large pixmap. The simplest way to support large pixmap's self compositing is to just clone a pixmap private data structure, and change the fbo and box to point to the correct postions. Don't need to copy a new box. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:51 -08:00
Zhigang Gong	1d2d858b8d	largepixmap: Add transform/repeat/reflect/pad support. This commit implement almost all the needed functions for the large pixmap support. It's almost complete. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:51 -08:00
Zhigang Gong	48916a23a9	glamor_getimage: should call miGetimage if failed to get sub-image. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:51 -08:00
Zhigang Gong	56d6e7a85f	glamor_putimage: Correct the wrong stride value. We should not use the destination pixmap's devkind as the input image data's stride. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:51 -08:00
Zhigang Gong	eb6f981ba4	largepixmap: Enable glamor_composite. Now we start to enable glamor_composite on large pixmap. We need to do a three layer clipping to split the dest/source/mask to small pieces. This commit only support non-transformation and repeat normal case. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:51 -08:00
Zhigang Gong	e96ea02010	largepixmap: Implement infrastructure for large pixmap. Added infrastructure for largepixmap, this commit implemented: 1. Create/Destroy large pixmap. 2. Upload/Download large pixmap. 3. Implement basic repeat normal support. 3. tile/fill/copyarea large pixmap get supported. The most complicated part glamor_composite still not implemented. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:51 -08:00
Zhigang Gong	ace35e408c	glamor_largepixmap: first commit for large pixmap. This is the first commit to add support for large pixmap. The large here means a pixmap is larger than the texutre's size limitation thus can't fit into one single texutre. The previous implementation will simply fallback to use a in memory pixmap to contain the large pixmap which is very slow in practice. The basic idea here is to use an array of texture to hold the large pixmap. And when we need to get a specific area of the pixmap, we just need to compute/clip the correct region and find the corresponding fbo. We need to implement some auxiliary routines to clip every rendering operations into small pieces which can fit into one texture. The complex part is the transformation/repeat/repeatReflect and repeat pad and their comination. We will support all of them step by step. This commit just add some necessary data structure to represent the large pixmap, and doesn't change any rendering process. This commit doesn't add real large pixmap support. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:51 -08:00
Junyan He	4c174f4c9c	Fix the problem of x_source and y_source causing radial error The x_source and y_source cause some problem in gradient. The old way to handle it by recaulate P1 P2 to minus the x_source and y_source, but this causes problem in radial shader. Now we modify the manner to set the texture coordinates: (x_source, y_source) --> (x_source + width, y_source + height) to handle all the cases. Reviewed-by: Zhigang Gong <zhigang.gong@linux.intel.com> Signed-off-by: Junyan He <junyan.he@linux.intel.com> Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:51 -08:00
Junyan He	553910d08b	Fix the problem of vertical and horizontal case error in linear gradient. 1. The vertical and horizontal judgement in linear gradient have problem when p1 point and p2 point distance is very small but the gradient pict have a transform matrix which will convert the X Y coordinates to small values. So the judgement is not suitable. Because this judgement's purpose is to assure the divisor not to be zero, so we simply it to enter horizontal judgement when p1 and p2's Y is same. Vertical case is deleted. 2. Delete the unused p1 p2 uniform variable. Reviewed-by: Zhigang Gong <zhigang.gong@linux.intel.com> Signed-off-by: Junyan He <junyan.he@linux.intel.com> Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:51 -08:00
Junyan He	41aa93c393	Fix the problem of set the same stop several times. Some gradient set the stops at the same position, for example: firstly 0.5 to red color and then set 0.5 to blue. This kind of setting will cause the shader work not correctly because the percentage caculating need to use the stop[i] - stop[i-1] as dividend. The previous patch we just kill some stop if the distance between them is 0. But this cause the problem that the color for next stop is wrong. We now modify to handle it in the shader to avoid the 0 as dividend. Reviewed-by: Zhigang Gong <zhigang.gong@linux.intel.com> Signed-off-by: Junyan He <junyan.he@linux.intel.com> Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:51 -08:00
Junyan He	09de37ec1c	Fix a bugy macro definition. The macro like "#define LINEAR_SMALL_STOPS 6 + 2" causes the problem. When use it to define like "GLfloat stop_colors_st[LINEAR_SMALL_STOPS*4];" The array is small than what we supposed it to be. Cause memory corruption problem and cause the bug of render wrong result. Fix it. Signed-off-by: Junyan He <junyan.he@linux.intel.com> Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:51 -08:00
Junyan He	d900f553c2	Extract the gradient related code out. 1. Extract the logic of gradient from the glamor_render.c to the file glamor_gradient.c. 2. Modify the logic of gradient pixmap gl draw. Use the logic like composite before, but the gradient always just have one rect to render, so no need to set the VB and EB, replace it with just call glDrawArrays. 3.Kill all the warning in glamor_render.c Reviewed-by: Zhigang Gong<zhigang.gong@linux.intel.com> Signed-off-by: Junyan He <junyan.he@linux.intel.com> Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:51 -08:00
Zhigang Gong	8169280464	glamor_set_destination_pixmap_priv_nc: set drawable's width x height. Previous implementation set the whole fbo's width and height as the viewpoint. This may increase the numerical error as we may only has a partial region as the valid pixmap. So add a new marco pixmap_priv_get_dest_scale to get proper scale factor for the destination pixmap. For the source/mask pixmap, we still need to consider the whole fbo's size. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:51 -08:00
Zhigang Gong	7f55e48499	Remove the texture cache code. Caching texture objects is not necessary based on previous testing. To keep the code simple, we remove it. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:51 -08:00
Zhigang Gong	c5b3c2cedc	Added strict warning flags to CFLAGS. We miss the strict warning flags for a long time, now add it back. This commit also fixed most of the warnings after enable the strict flags. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:51 -08:00
Zhigang Gong	6839996b0b	We should not call gradient finalization code if we disable it. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:51 -08:00
Zhigang Gong	1035fc72b9	Fixed all unused variables warnings. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:51 -08:00
Zhigang Gong	33e11cd614	Fixed an uninitialized problem at gradient shader functions. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:51 -08:00
Zhigang Gong	c0f75c657f	Fixed one typo bug when fixup a mask picture. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:51 -08:00
Zhigang Gong	5c1f15fac2	Added some copyright and author information. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:51 -08:00
Zhigang Gong	0d846d9569	Added --enable-debug configuration option. For release version, we disable asserts. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:51 -08:00
Zhigang Gong	503f8ec1a6	Remove unecessary header file. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:51 -08:00
Zhigang Gong	9dfd10dc75	glamor_render: Fix the repeat none for GLES2. As GLES2 doesn't support clamp to the border, we have to handle it seprately from the normal case. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:51 -08:00
Zhigang Gong	9fcd123aed	glamor_blockhandler: Don't do glFinish every time. To do glfinish every time bring some performance overhead. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:51 -08:00
Zhigang Gong	1f83411c9a	glamor_copyarea: Return earlier if have zero nbox. Almost all callers will check whether the regions is empty before call to this internal API, but it seems the glamor_composite_with_copy may call into here with a zero nbox. A little weird, as the miComputeCompositeRegion return a Non-NULL, but the region is empty. Also remove a unecessary glflush. So let's check it here. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:50 -08:00
Zhigang Gong	20cbaa61cd	glamor_render: Have to use eaxct size pixmap for transformation. Use partial texture as the pixmap for the transformation source/mask may introduce extra errors. have to use eaxct size. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:50 -08:00
Zhigang Gong	6e50ee9c10	glamor_fbo: Added a threshold value for the fbo cache pool. Currently set it to 256MB. If cache pool watermark increases to this value, then don't push any fbo to this pool, will purge the fbo directly. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:50 -08:00
Zhigang Gong	540846204c	Fixed a1 bug. It seems that mesa has bugs when uploading bitmap to texture. We switch to convert bitmap to a8 format and then upload the a8 texture. Also added a helper routine to dump 1bpp pixmap. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:50 -08:00
Zhigang Gong	9f53cc1c33	glamor_render.c: Fixed repeatPad and repeatRelect. We should use difference calculation for these two repeat mode when we are a sub region within one texture. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:50 -08:00
Zhigang Gong	67cf3838e4	gradient: Don't need fixup flag when creating pixmap. Gradient can use a larger texture/fbo directly, don't need an eaxct size texture. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:50 -08:00
Zhigang Gong	8a85071edb	glamor_copyarea: Don't access a DRM only pixmap. As EGL image/gbm only support ARGB8888 image, we don't support other format. We may change the way to use gbm directly latter. But now, we have to face this limitation, and thus if a client create a 16bpp drawable, and call get texture from pixmap then a copy to here may occur and thus we have to force retur a TRUE without do nothing. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:50 -08:00
Zhigang Gong	0b6867dddb	Disable A8 texture format for GLES2. As PVR's GLES2 implementation doesn't support A8 texture as rendering target, we disable it for now. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:50 -08:00
Zhigang Gong	6b664dda69	gradient: Disable gradient for gles2. As PVR glsl compiler seems doesn't support external fragment function, and fails at compile gradient shader. Disable it for now. We may need to modify gradient shader to don't use external function. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:50 -08:00
Junyan He	686a322c76	Fix the bug caused by gradient picture set the stops at the same percentage. Fix the bug caused by gradient picture set the stops at the same percentage. The (stops[i] - stops[i-1]) will be used as divisor in the shader, which will cause problem. We just keep the later one if stops[i] == stops[i-1]. Signed-off-by: Junyan He <junyan.he@linux.intel.com> Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:50 -08:00
Junyan He	3d96929596	Fix the problem of memory leak in gradient pixmap generating. Fix the problem of memory leak in gradient pixmap generating. The problem caused by we do not call glDeleteShader when destroy a shader program. This patch will split the gradient pixmap generating to three category. If nstops < 6, we will use the no array version of the shader, which has the best performance. Else if nstops < 16, we use array version of the shader, which is compiled and linked at screen init stage. Else if nstops > 16, we dynamically create a new shader program, and this program will be cached until bigger nstops. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:50 -08:00
Zhigang Gong	05da99106b	glamor_putimage: Optimize for direct uploading and fallback path. This commit optimize two cases: 1. When the clip contains the whole area, we can directly upload the texel data to the pixmap, and don't need to do one extra clipped copy. 2. At fallback path, we don't read back the whole pixmap, just need a sub region. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:50 -08:00
Zhigang Gong	ea70ebe0ac	Fixed one potential texture size mismatch problem. There are two cases which we may use a wrong texture size. 1. A pixmap is modified by the client side after it created it. Then the pixmap's width may mismatch the original fbo/tex's size. Thus we need to check this condition when preparing upload the pixmap. 2. We provide two API to download/upload sub region of a textured pixmap. The caller may pass in a larger width then the original pixmap's size, this may happen at putimage and setspans. We need to validate the width and height when do the downloading/uploading. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:50 -08:00
Zhigang Gong	08e8c00fe6	glamor_getimage: Don't fallback to miGetImage. As miGetImage is very inefficient, we don't fallback to it. If the format is not ZPixmap, we download the required sub- region, and then call fbGetImage to do the conversion. This way is much faster than previous. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:50 -08:00
Zhigang Gong	9bcddff93b	pending_op: Remove the pending operations handling. We have disabled this feature for a long time, and previous testing shows that this(pending fill) will not bring observed performance gain. Now remove it. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:50 -08:00
Zhigang Gong	1761768f49	glamor_upload_pixmap: Use glTexImage2D for a fully update. Currently, intel's mesa dri driver will not check pbo for a TexSubImage2D. So we use glTexImage2D if we are a fully updating. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:50 -08:00
Zhigang Gong	2806f1eace	glamor_setspans: Reuse glamor_upload_sub_pixmap. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:50 -08:00
Zhigang Gong	e15bc12074	code clean up. Remove unused variables. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:50 -08:00
Zhigang Gong	65c5605c96	glamor_getspans: Reuse glamor_download_sub_pixmap. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:50 -08:00
Zhigang Gong	68a5cc6f37	glamor_render: Don't download whole picture when fallback. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:50 -08:00
Zhigang Gong	e38eb67532	glamor_put_sub_pixmap: Change to use glamor_upload_sub_pixmap. As the pixmap may be attached to a picture, we need to use glamor_upload_sub_pixmap to process it. glamor_copy_n_to_n will not consider the picture case. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:50 -08:00
Zhigang Gong	ff3d2c7963	Fixed a stride problem for textured_drm pixmap. As a textured_drm pixmap has a drm bo attached to it, and it's the DDX layer to set it stride value. In some case, the stride value is not equal to PixmapBytePad(w, depth) which is used within glamor. Then if it is the case, we have two choice, one is to set the GL_PACK_ROW_LENGTH/GL_UNPACK_ROW_LENGTH when we need to download or upload the pixmap. The other option is to change the pixmap's devKind to match the one glamor is using when downloading the pixmap, and restore it to the drm stride after uploading the pixmap. We choose the 2nd option, as GLES doesn't support the first method. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:50 -08:00
Zhigang Gong	70b71718e7	glamor_putimage: Reuse copy area to do the clipped copy. If no clip set, we load the bits to the pixmap directly. Otherwise, load the bits to a temporary pixmap and call glamor_copy_area to do the clipped copy. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:50 -08:00
Zhigang Gong	e1be714312	Fixed a unbalanced glamor_put_dispatch. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:50 -08:00
Zhigang Gong	bd53e24dc3	glamor_pixmap_priv: Always return a valid private pixmap. If a pixmap doesn't have a private, then set its type to GLAMOR_MEMORY, and thus it will get a valid private. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:50 -08:00
Zhigang Gong	420af44a3a	Don't need to set GL_PACK_ROW_LENGTH/GL_UNPACK_ROW_LENGTH. We already adjust the stride of the pixmap, and keep the alignment as 4 should be ok to let the GL/GLES match the stride. Previous version has a unbalanced PACK ROW length seting, and is buggy, now fixed it. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:50 -08:00
Zhigang Gong	18d69fb014	glamor_gl: Use GL_ALPHA for depth 8 pixmap. Use GL_RGBA to represent a8 pixmap is not efficient. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:50 -08:00
Zhigang Gong	428f2a3f58	glamor_pixmap_ensure_fbo: Should allocate tex if we don't have one. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:50 -08:00
Zhigang Gong	cf0e206a0f	glamor_polylines: Don't fallback for non-solid fill. As glamor_fill/fbFill will handle non-solid fill correctly. We don't fallback it here. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:49 -08:00
Zhigang Gong	b5bd9a2d90	glamor_upload/download: fix 1bpp bug. For A1 to A8's conversion, the stride is different for the source and destination. Previous implementation use the same stride, and may allocate less memory than required. Thus may crash the server when uploading a A1 pixmap. Now fix it. Tested-by: Peng Li <peng.li@intel.com> Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:49 -08:00
Zhigang Gong	b0e91f0f5a	glamor_pixmap_upload_texture: Support to upload a sub region of data. Just as the downloading side, we can upload an sub region data to a pixmap's specified region. The data could be in memory or in a pbo buffer. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:49 -08:00
Zhigang Gong	3061f348ca	glamor_getimage: Use glamor_download_sub_pixmap_to_cpu to get image. Reduce the duplicate logic. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:49 -08:00
Zhigang Gong	3a91f16912	glamor_polyfillrect: Fixed a potential bug if fallback at glamor_fill. We should advance the prect after we successfully excuted the glamor_fill. And if failed, we need to add the failed 1 box back to nbox. Although, this bug will never happen currently, as glamor_fill will never fallback. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:49 -08:00
Zhigang Gong	1f657f72ca	glamor_polyfillrect: Optimize fallback path. Download/upload required region only. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:49 -08:00
Zhigang Gong	cea0fe3e1f	fallback_optimize: Prepare for downloading/uploading subregion. Introduced two function glamor_get_sub_pixmap/glamor_put_sub_pixmap, can easily used to get and put sub region of a big textured pixmap. And it can use pbo if possible. To support download a big textured pixmap's sub region to another pixmap's pbo, we introduce a new type of pixmap GLAMOR_MEMORY_MAP. This type of pixmap has a valid devPrivate.ptr pointer, and that pointer points to a pbo mapped address. Now, we are ready to refine those glamor_prepare_access/glamor_finish_access pairs. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:49 -08:00
Zhigang Gong	d9dfc3d795	glamor_download_sub_pixmap_to_cpu: New function to download subregion. Prepare to optimize the fallback path. We choose the important rendering pathes to optimzie it by using shader. For other pathes, we have to fallback. We may continue to optimize more pathes in the future, but now we have to face those fallbacks. The original fallback is very slow and will download/upload the whole pixmap. From this commit, I will refine it to just download/upload needed part. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:49 -08:00
Zhigang Gong	d96226ac6f	glamor_es2_pixmap_read_prepare: Just prepare the required region. Don't need to prepare the whole source pixmap. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:49 -08:00
Zhigang Gong	3dbdd40c6c	glamor_color_convert: Let the caller to provide destination buffer. As we don't need to allocate new buffer when downloading pixmap to CPU, we change the prototype of the color converting function and let the caller to provide the buffer to hold the result. All the color conversion function supports store the result just at the same place of the source. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:49 -08:00
Zhigang Gong	4dc6d4e84b	glyphblt/polyops: Use miFunctions by default. Calling to miFunctions give some opportunities to jump to accelerated path, so we switch to call miFunctions rather than fallback to fbFunctions directly.	2013-12-18 11:23:49 -08:00
Zhigang Gong	49e3b44aa8	glamor_set_alu: Added GXclear support at glamor_solid. We don't need to issue the glamor_fallback at the glamor_set_alu routine, as the caller may support GXclear or other most frequent Ops. Leave it to the caller to determine fallback or not. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:49 -08:00
Zhigang Gong	3b8b2c77fc	getimage: Enable getimage by default. Fixed one bug when calculate the coords, should consider the drawable's x and y. Now enable it by default. Most of the time, it should be more efficient than miGetImage. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:49 -08:00
Zhigang Gong	c6ce44d881	render: Enable more componentAlpha support. Actually only PictOpAtop,PictOpAtopReverse and PictOpXor can't be implemented by using single source blending. All the other can be easily support. Slightly change the code to support them. Consider those three Ops are not frequenly used in real application. We simply fallback them currently. PictOpAtop: smaskdst.a + (1 - s.amask)dst PictOpAtopReverse: smask(1 - dst.a) + dst s.amask PictOpXor: smask(1 - dst.a) + dst * (1 - s.a*mask) The two oprands in the above three ops are all reated to dst and the blend factors are not constant (0 or 1), it's hardly to convert it to single source blend. Now, the rendercheck is runing more smoothly. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:49 -08:00
Zhigang Gong	3e9c35bdcb	glamor_set_alu: Fallback for non GXcopy ops with GLES2. As GLES2 doesn't support LogiOps, we have to fallback here. GLES2 programing guide's statement is as below: "In addition, LogicOp is removed as it is very infrequently used by applications and the OpenGL ES working group did not get requests from independent software vendors (ISVs) to support this feature in OpenGL ES 2.0." So, I think, fallback here may not a big deal ;). Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:49 -08:00
Zhigang Gong	1a238e89f3	glamor_putimage: Reuse the function in pixmap.c to do the uploading. We reuse glamor_upload_bits_to_pixmap_texture to do the data uploading to texture in putimage. Besides to avoid duplicate code, this also fixed the potential problem when the data format need extra reversion which is not supported by the finish shader, as glamor_upload_bits_to_pixmap_texture will handle all conditions. Tested-by: Junyan He <junyan.he@linux.intel.com> Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:49 -08:00
Zhigang Gong	0650c7d4be	gles2: Added 1555/2101010 formats support. Added color conversion code to support 1555/2101010 formats,now gles2 can pass the render check with all formats. We use 5551 to represent 1555, and do the revertion if downloading/uploading is needed. For 2101010, as gles2 doesn't support reading the identical formats. We have to use 8888 to represent, thus we may introduce some accurate problem. But anyway, we can pass the error checking in render check, so that may not be a big problem. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:49 -08:00
Zhigang Gong	3add375065	gles2: Fixed color conversion for the formats except 1555 and 2101010. This patch fixed two major problems when we do the color convesion with GLES2. 1. lack of necessary formats in FBO pool. GLES2 has three different possible texture formats, GL_RGBA, GL_BGRA and GL_ALPHA. Previous implementation only has one bucket for all the three formats which may reuse a incorrect texture format when do the cache lookup. After this fix, we can enable fbo safely when running with GLES2. 2. Refine the format matching method in glamor_get_tex_format_type_from_pictformat. If both revertion and swap_rb are needed, for example use GL_RGBA to represent PICT_b8g8r8a8. Then the downloading and uploading should be handled differently. The picture's format is PICT_b8g8r8a8, Then the expecting color layout is as below (little endian): 0 1 2 3 : address a r g b Now the in GLES2 the supported color format is GL_RGBA, type is GL_UNSIGNED_TYPE, then we need to shuffle the fragment color as : frag_color = sample(texture).argb; before we use glReadPixel to get it back. For the uploading process, the shuffle is a revert shuffle. We still use GL_RGBA, GL_UNSIGNED_BYTE to upload the color to a texture, then let's see 0 1 2 3 : address a r g b : correct colors R G B A : GL_RGBA with GL_UNSIGNED_BYTE Now we need to shuffle again, the mapping rule is r = G, g = B, b = A, a = R. Then the uploading shuffle is as below: frag_color = sample(texture).gbar; After this commit, gles2 version can pass render check with all the formats except those 1555/2101010. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:49 -08:00
Zhigang Gong	55fdc7b196	glamor_utils: Added debug function to dump depth 15/16 pixmap. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:49 -08:00
Zhigang Gong	57e29ebdc1	glamor_render: Disable gradient shader conversion due to bug. I found when enable the gradient shader, the firefox's tab's background has incorrect rendering result. Need furthr investigation, for now, just disable it. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:49 -08:00
Zhigang Gong	7036cfdd0d	glamor_fbo: Added one macro to disable fbo cache. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:49 -08:00
Zhigang Gong	94186db527	glamor_fill: Should restore alu to GXcopy. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:49 -08:00
Junyan He	1f4486c10b	Add the feature for radial gradient using shader. Add the feature for radial gradient using shader. The transform matrix and the 4 type of repeat mode are supported. Less than 2/255 difference for every color component comparing to pixman's result. Extract the common logic of linear and radial's to another shader. Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk> Reviewed-by: Zhigang Gong <zhigang.gong@linux.intel.com> Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:49 -08:00
Junyan He	1026327cdc	Add the feature of generating linear gradient picture by using shader. Add the feature of generating linear gradient picture by using shader. This logic will replace the original linear gradient picture generating manner in glamor which firstly use pixman and then upload it to GPU. Compare it to the result generated by pixman, the difference of each color component of each pixel is normally 0, sometimes 1/255, and 2/255 at most. The pixman use fixed-point but shader use float-point, so may have difference. The feature of transform matrix and 4 types of repeat modes have been supported. The array usage in shader seems slow, so use 8 uniform variables to avoid using array when stops number is not very big. This make code look verbose but the performance improved a lot. We still have slightly performance regression compare to original pixman version. There are one further optimization opportunity which is to merge the gradient pixmap generation and the latter compositing into one shader, then we don't need to generate the extra texture, we can use the gradient value directly at the compositing shader. Hope that can beat pixman version. Will do that latter. Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk> Reviewed-by: Zhigang Gong <zhigang.gong@linux.intel.com> Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:49 -08:00
Junyan He	ccf5d7f52b	Prepare for modification of gradient using shader. Prepare for modification of gradient using shader. The gradient pixmaps now is generated by pixman and we will replace them with shader. Add structure fields and dispatch functions which will be needed. Some auxiliary macro for vertex convert. Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk> Reviewed-by: Zhigang Gong <zhigang.gong@linux.intel.com> Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:49 -08:00
Junyan He	a57bf66d49	glamor_utils: Add some assistant functions to compare pixmaps/pictures. Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk> Reviewed-by: Zhigang Gong <zhigang.gong@linux.intel.com> Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:49 -08:00
Junyan He	cd75e85ff3	Fixup For list.h change in xorg Because the file list.h in xorg/include has changed the functions and struct names, adding xorg_ prefix before the original name. So Modify glamor_screen_private struct and the code which use list's functions in glamor_fbo.c. We hack at glamor_priv.h avoid the compile error when using old version xserver header file. Signed-off-by: Junyan He <junyan.he@linux.intel.com>	2013-12-18 11:23:49 -08:00
Zhigang Gong	213285f2b8	For DRI swap buffers. This commit added two APIs to support the DRI swap buffer. one is glamor_egl_exchange_buffers() which can swap two pixmaps' underlying KHRimages/fbos/texs. The DDX layer should exchange the DRM bos to make them consistent to each other. Another API is glamor_egl_create_textured_screen_ext(), which extent one more parameters to track the DDX layer's back pixmap pointer. This is for the triple buffer support. When using triple buffer, the DDX layer will keep a back pixmap rather then the front pixmap and the pixmap used by the DRI2 client. And during the closing screen stage, we have to dereference all the back pixmap's glamor resources. Thus we have to extent this API to register it when create new screen. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:49 -08:00
Zhigang Gong	8012b030c3	glamor_copyarea: Don't use GL_CLAMP_TO_BORDER when GLES2 enabled. We may need to modify all the shader to handle GL_CLAMP_TO_BORDER when using GLES2. XXX, for now, we just ignore them. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:48 -08:00
Zhigang Gong	5ccf721d38	glamor_fbo: Fix a bug when create No gl FBO pixmap. Need to get format before goto create a new fbo. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:48 -08:00
Zhigang Gong	ce634e84d4	glamor_render: Only recalculate texture for repeat case. Slightly optimize the fragment shader, as if we are not repeat case and not exceed the valid texture range, then we don't need to recalculate the coords. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:48 -08:00
Zhigang Gong	53387728dd	glamor_tile/composite: Modify fs to re-calculate texture coords. Then we don't need to fixup the larger pixmap to the exact size, just need to let the shader to re-calculate the correct texture coords. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:48 -08:00
Chris Wilson	556adfa6b9	Fixup glx support Renaming glamor_priv->dispatch and wrapping the access to the dispatch table with a function that also ensured the context was bound. dispatch = glamor_get_dispatch(glamor_priv); ... glamor_put_dispatch(glamor_priv); So that we catch all places where we attempt to call into GL withouta context. As an optimisation we can then do glamor_get_context(); glamor_put_context() around the rendering entry points to reduce the frequency of having to restore the old context. (Along with allowing the context to be recursively acquired and making the old context part of the glamor_egl state.) Reviewed-by: Zhigang Gong <zhigang.gong@linux.intel.com> Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:48 -08:00
Zhigang Gong	430bc16ca0	GLX: Enable glx support. If we are using MESA as our GL library, then both xserver's GLX and glamor are link to the same library. As xserver's GLX has its own _glapi_get/set_context/dispatch etc, and it is a simplified version derived from mesa thus is not sufficient for mesa/egl's dri loader which is used by glamor. Then if glx module is loaded before glamoregl module, the initialization of mesa/egl/opengl will not be correct, and will fail at a very early stage, most likely fail to map the element buffer. Two methodis to fix this problem, first is to modify the xserver's glx's glapi.c to fit mesa's requirement. The second is to put a glamor.conf as below, to the system's xorg.conf path. Section "Module" Load "glamoregl" EndSection Then glamor will be loaded firstly, and the mesa's libglapi.so will be used. As current xserver's dispatch table is the same as mesa's, then the glx's dri loader can work without problem. We took the second method as it don't need any change to xorg.:) Although this is not a graceful implementation as it depends on the xserver's dispatch table and the mesa's dispatch table is the same and the context set and get is using the same method. Anyway it works. As by default, xserver will enable GLX_USE_TLS. But mesa will not enable it, you may need to enable that when build mesa. Three pre-requirements to make this glamor version work: 0. Make sure xserver has commit 66e603, if not please pull the latest master branch. 1. Rebuild mesa by enable GLX_USE_TLS. 2. Put the glamor.conf to your system's xorg.conf path and make sure it loaded prior to glx module. Preliminary testing shows indirect glxgears works fine. If user want to use GLES2 for glamor by using MESA, GLX will not work correctly. If you are not using normal MESA, for example PVR's private GLES implementation, then it should be ok to use GLES2 glamor and the GLX should work as expected. In this commit, I use gbm to check whether we are using MESA or non-mesa. Maybe not the best way. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:48 -08:00
Zhigang Gong	0a8fb8563f	glamor_pixmap: Should bind unpack buffer to 0 after the uploading. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:48 -08:00
Zhigang Gong	e03ad27dfb	glamor_picture: Fix the wrong order of header file. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:48 -08:00
Zhigang Gong	bf24c2c068	glamor_dump_pixmap: Add helper routine to dump pixmap. For debug purpose only to dump the pixmap's content. As this function will call glamor_prepare_access/glamor_finish_access internally. Please use it carefully. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:48 -08:00
Zhigang Gong	91891b3711	glamor_fill/tile: Fixed a tileX/tileY calculation bug. The previous's calculation is incorrect, now fix it and then we don't need to fallback at glamor_tile. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com> Tested-by: Peng Li <peng.li@intel.com>	2013-12-18 11:23:48 -08:00
Zhigang Gong	39d9e6c693	prepare_access: Don't use fbo after it was downloaded. We add a new gl_fbo status GLAMOR_FBO_DOWNLOADED to indicate the fbo was already downloaded to CPU. Then latter the access to this pixmap will be treated as pure CPU access. In glamor, if we fallback to DDX/fbXXX, then we fallback everything currently. We don't support to jump into glamor acceleration layer between a prepare_access/finish_access. Actually, fbCopyPlane is such a function which may call to acceleration function within it. Then we must mark the downloaded pixmap to another state rather than a normal fbo textured pixmap, and then stick to use it as a in-memory pixmap. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com> Tested-by: Peng Li <peng.li@intel.com>	2013-12-18 11:23:48 -08:00
Zhigang Gong	1817b6c0cf	glamor_eglmodule: Change module name according to normalize naming rule. As Xorg module loader will normalize module name which will remove '_' when we put "glamor_egl" to the configure file, then it will fail to find us. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:48 -08:00
Zhigang Gong	1ab4002874	Don't call dixSetPrivate directly. We may change the way to set/get those private data latter. consolidate to glamor_set_pixmap/screen_private is better than call those dixSetPrivate directly. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:48 -08:00
Zhigang Gong	bf7d79dc0a	Refine CloseScreen and FreeScreen processes. This commit move the calling to glamor_close_screen from glamor_egl_free_screen to glamor_egl_close_screen, as this is the right place to do this. We should detach screen fbo and destroy the corresponding KHR image at glamor_egl_close_screen stage. As latter DDX driver will call DestroyPixmap to destroy screen pixmap, if the fbo and image are still there but glamor screen private data pointer has been freed, then it causes segfault. This commit also introduces a new flag GLAMOR_USE_EGL_SCREEN. if DDX driver is using EGL layer then should set this bit when call to glamor_init and then both glamor_close_screen and glamor_egl_close_screen will be registered correctly, DDX layer will not need to call these two functions manually. This way is also the preferred method within Xorg domain. As interfaces changed, bump the version to 0.3.0. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com> Tested-by: Peng Li <peng.li@intel.com>	2013-12-18 11:23:48 -08:00
Chris Wilson	97efbd25fe	Use CLAMP_TO_BORDER in copy_n_to_n so we can sample outside of the source In order to reduce a composite operation to a source, we need to provide Render semantics for the pixel values of samples outside of the source pixmap, i.e. they need to be rgba(0, 0, 0, 0). This is provided by using the CLAMP_TO_BORDER repeat mode, but only if the texture has an alpha channel. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk> Reviewed-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:48 -08:00
Chris Wilson	864153bb9e	Do not reduce a composite to a copy if we need to sample outside of the source In order to maintain Render semantics, samples outside of the source should return CLEAR. The copy routines instead are based on the core protocol and expects the source rectangle to be wholly contained within the drawable and so does no fixup. Fixes the rendering of GTK icons. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk> Reviewed-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:48 -08:00
Zhigang Gong	566cca59e1	glamor-gles2: Fixup the pixmap before read back if it is not readable. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:48 -08:00
Zhigang Gong	36ac9b7191	glamor-fbo: Tweek the cache bucket calculation. And also reduce the expire count to 100 which should be good enough on x11perf and cairo-trace testing. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:48 -08:00
Zhigang Gong	a1de528c56	glamor_create_fbo: Concentrate the fbo size/depth checking. Concentrate checking the size/depth when creating fbo. Simply the pixmap creation and the uploading fbo/texture preparing. Also slightly change the uploading fbo's preparation. If don't need fbo, then a fbo only has valid texture should be enough to return. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:48 -08:00
Zhigang Gong	1bfe595711	glamor-pixmap-upload: Create a uploading fbo with a texture only. Just an initial implementation and disabled by default. When uploading a pixmap to a texture, we don't really want to attach the texture to any fbo. So add one fbo type which doesn't has a gl FBO attached to it. This commit can increase the cairo-trace's performance by 10-20%. Now the firefox-planet-gnome is 8.3s. SNA is still the best, only take 3.5s. Thanks for Chris to point out the A1 pixmap uploading bug. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:48 -08:00
Li Peng	6fb92e67b3	glamor: check driver support GEM or not glamor calls DRM_IOCTL_GEM_FLINK to get a name for a buffer object. It only works for driver that support GEM, such as intel i915 driver. But for pvr driver who doesn't has GEM, we can't do it this way. According to Chris's comments, we check the has_gem as the following method: Here we just try to flink handle 0. If that fails with ENODEV or ENOTTY instead of ENOENT (or EINVAL on older kernels), set has_gem=0. Signed-off-by: Li Peng <peng.li@intel.com> Reviewed-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:48 -08:00
Zhigang Gong	68789b23e7	glamor_gles2: Consolidate gles2 pixmap format readable check to one function. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:48 -08:00
Zhigang Gong	3373d2c696	glamor_egl: Add support for the platform doesn't have gbm. Maybe we should use eglGetDisplayDRM to get display, but current PVR's driver is using eglGetDisplay. Signed-off-by: Peng Li <peng.li@intel.com> Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:48 -08:00
Zhigang Gong	92671e3ac8	glamor_egl: Don't call eglDestroyImageKHR directly. Some implementation doesn't have it. Signed-off-by: Peng Li <peng.li@intel.com> Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:48 -08:00
Peng Li	125f317d90	glamor_gl_dispatch: fix the dispatch initialization on GLES2. Some gles2 implementation doesn's support get_proc_address. And we also need to avoid get those missing functions pointers when we are GLES2. Signed-off-by: Peng Li <peng.li@intel.com> Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:48 -08:00
Zhigang Gong	64fef665c9	glamor_render: Add non-Map/Unmap vertex array for GLES. As some GLES implementations' glMapOES /glUnmapOES is not so efficient, we implement the in memory vertex array for them. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:48 -08:00
Zhigang Gong	c244969b33	glamor_init: Should set gl_flavor before sub-module intialization. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:48 -08:00
Zhigang Gong	62e5365351	glamor_composite: Fix one bug when we have too more vertices. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:47 -08:00
Zhigang Gong	9c6fd931a6	glamor-fbo-pool: Enable to reuse different size fbo/texture. Fixup three special cases, one is in tile and the other is in composite. Both cases are due to repeat texture issue. Maybe we can refine the shader to recalculate texture coords to support partial texture's repeating. The third is when upload a memory pixmap to texture, as now the texture may not have the exact size as the pixmap, we should not use the full rect coords. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:47 -08:00
Zhigang Gong	c7e79d6acf	glamor-fbo-pool: Implement fbo cache mechanism. We classify the cache according to the texture's format/width/height. As openGL doesn't allow us to change a texture's format/width/height after the internal texture object is already allocated, we can't just calculate the size and then according ths size to put the fbo to an bucket which is just like SNA does. We can only put the fbo to the corresponding format/width/height bucket. This commit only support the exact size match. The following patch will remove this restriction, just need to handle the repeat/tile case when the size is not exactly match. Should use fls instead of ffs when decide the width/height bucket, thanks for Chris to point this out. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:47 -08:00
Zhigang Gong	2ff4100849	glamor_fbo: Introduce glamor fbo to manage all the fb/tex. This is the first patch to implement a fbo/tex pool mechanism which is like the sna's BO cache list. We firstly need to decopule the fbo/tex from each pixmap. The new glamor_pixmap_fbo data structure is for that purpose. It's somehow independent to each pixmap and can be reused latter by other pixmaps once it's detached from the current pixmap. And this commit also slightly change the way to create a memory pixmap. We will not create a pixmap private data structure by default, instead we will crete that structure when a memory pixmap is attaching a fbo to it. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:47 -08:00
Zhigang Gong	ca2ddd33a1	glamor_set_pixmap_texture/screen_pixmap: Remove useless parameters. As after we got a texture, no matter the texture is created on the glamor_create_pixmap or on the egl layer, we all already know the texture's width and height there. We don't need to pass them in. This commit also simply the glamor_egl_create_textured_screen to reuse the egl_create_textured_pixmap. And also remove the useless root image from the egl private structure. As now the root image is bound to the screen image, we don't take care it separately here. It will be freed at the screen closing. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:47 -08:00
Zhigang Gong	15166bba97	Add debug message for all the uploading path. The previous message missed the texture restoring path. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:47 -08:00
Zhigang Gong	994a9ff7f5	glamor_create_picture: Fix the format matching method. We should not simply set a TEXTURE_DRM pixmap to a separated texture pixmap. If the format is compatible with current fbo then it is just fine to keep it as TEXTURE_DRM type and we can safely fallback to DDX layer on it. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:47 -08:00
Zhigang Gong	28fcd7cd01	Rearrange data structure and remove unused fileds. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:47 -08:00
Zhigang Gong	bdd72da0c5	Release previous textre/fb when bind a new texture to a pixmap. As we want to support DRI2 drawable which may create a new textured_drm to a pre-existing texture_only pixmap, we have to add this logical. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com> Tested-by: He Junyan<junyan.he@linux.intel.com>	2013-12-18 11:23:47 -08:00
Zhigang Gong	d7352d57b9	Add glFinish after glFlush. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:47 -08:00
Zhigang Gong	8b943ce203	Set glamor's initial version to 0.2.0. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:47 -08:00
Zhigang Gong	7329414bf2	Silence a compilation warning. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:47 -08:00
Zhigang Gong	069a6d1746	glamor_composite: Allocate VBO on demand. Use a fixed VBO is not efficient. Some times we may only has less than 100 verts, and some times we may have larger than 4K verts. We change it to allocate VBO buffer dynamically, and this can bring about 10% performance gain for both aa10text/rgb10text and some cairo benchmarks. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:47 -08:00
Zhigang Gong	42a0261cb3	glamor_getimage: Add the optimization path of getImage. This optimization will only call glReadPixels once. It should get some performance gain. But it seems it even get worse performance at SNB, disable it by default. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:47 -08:00
Zhigang Gong	96085017c8	Consolidate the choose of internal texture format to one function. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:47 -08:00
Zhigang Gong	a74596be0e	Set filter to GL_NEAREST by default. This is the fastest filter and let's use it by default. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:47 -08:00
Zhigang Gong	4cd07871a4	glamor-composite: Use glDrawElements to reduce the count of vertices. To split a rectangle (0,1,2,3) to two separated triangles need to feed 6 vertices, (0,1,2) and (0,2,3). use glDrawElements can reuse the shared vertices. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:47 -08:00
Zhigang Gong	9dafd6fce5	glamor-composite: Optimize the computation of composite region. Computing the composite region at the composite_with_shader is very inefficient. As when we call to here from the glamor_glyph's temproary picture, we don't need to compute this region at all. So we move this computing out from this function and do that at the glamor_composite function. This can get about 5% performance gain for aa10text/rgb10text. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:47 -08:00
Zhigang Gong	2511a00cdd	Fixed a configure bug. Should check the enable-glamor-gles2 before use the variable. And should include the config.h as the GLAMOR_GLES2 macro is defined there. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:47 -08:00
Zhigang Gong	a65e1c736a	Reduce the double check of pixmap's private pointer. As we now add the checking to the Macro, we don't need to check the pointer outside the Macro. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:47 -08:00
Zhigang Gong	e1789893e5	get_spans: Check whether have a valid fbo before check format. If a pixmap is a pure in-memory pixmap, we do not need to check its format. Format checking has more overhead than checking FBO, so we change to check fbo firtly. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:47 -08:00
Zhigang Gong	057f52a04d	Track all picture's drawable pict format. Even if a picture's pixmap is a pure in memory pixmap, we still need to track its format. The reason is we may need to upload this drawable to texture thus we need to know its real picture format. As to the MACRO to check whether a pixmap is a picture, we should check whether the priv is non-NULL before we touch its field. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:47 -08:00
Zhigang Gong	8a4758a358	Need to check pixmap priv before touch its field. As now the pixmap may be allocated by DDX and doesn't have a valid pixmap private field. We must check pixmap private pointer before touch its field value. If a pixmap doesn't have a non-NULL private pointer, it doesn't have a valid FBO. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:47 -08:00
Zhigang Gong	9264335347	Added more drawing functions. As we want to take over all the possible GC ops from the DDX layer, we need to add all the missed functions. This commit also fixed one bug at polylines. We simply drop the bugy optimized code now, as it did not consider of clip info. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:47 -08:00
Zhigang Gong	d42eb04c29	Remove useless output messages. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:47 -08:00
Zhigang Gong	fbccc4bbbc	Fixed a rendering bug at fillspans. We should not change the points coords when loop for the clip rects. Change to use another variable to store the clipped coords and keep the original coords. This bug cause some XTS failures. Now fix it. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:47 -08:00
Zhigang Gong	70b6341538	Fixed a bug at putImage. fbPutImage wants the input drawable is the target drawable rather than the backing pixmap. This bug cause some XTS failures, now fix it. Signed-off-by: Zhigang Gong <zhigang.gong@linux.intel.com>	2013-12-18 11:23:47 -08:00

... 2 3 4 5 6 ...

564 Commits