vello

alex/vello

mirror of https://github.com/italicsjenga/vello.git synced 2025-01-09 20:31:29 +11:00

Author	SHA1	Message	Date
Elias Naur	a5b6bda941	add support for element flags to shaders Commit `9afa9b86b6` added Rust support for encoding flags into elements. This change adds support to shaders by introducing variant tag structs: struct VariantTag { uint tag; uint flags; } and returning them from Variant_tag functions. It also adds a flags argument to write functions for enum variants that include TagFlags. No functionality changes. Updates #70 Signed-off-by: Elias Naur <mail@eliasnaur.com>	2021-03-19 12:50:12 +01:00
Elias Naur	903ab1fb59	implement FillImage command and sRGB support FillImage is like Fill, except that it takes its color from one or more image atlases. kernel4 uses a single image for non-Vulkan hosts, and the dynamic sized array of image descriptors on Vulkan. A previous version of this commit used textures. I think images are a better choice for piet-gpu, for several reasons: - Texture sampling, in particular textureGrad, is slow on lower spec devices such as Google Pixel. Texture sampling is particularly slow and difficult to implement for CPU fallbacks. - Texture sampling need more parameters, in particular the full u,v transformation matrix, leading to a large increase in the command size. Since all commands use the same size, that memory penalty is paid by all scenes, not just scenes with textures. - It is unlikely that piet-gpu will support every kind of fill for every client, because each kind must be added to kernel4. With FillImage, a client will prepare the image(s) in separate shader stages, sampling and applying transformations and special effects as needed. Textures that align with the output pixel grid can be used directly, without pre-processing. Note that the pre-processing step can run concurrently with the piet-gpu pipeline; Only the last stage, kernel4, needs the images. Pre-processing most likely uses fixed function vertex/fragment programs, which on some GPUs may run in parallel with piet-gpu's compute programs. While here, fix a few validation errors: - Explicitly enable EXT_descriptor_indexing, KHR_maintenance3, KHR_get_physical_device_properties2. - Specify a vkDescriptorSetVariableDescriptorCountAllocateInfo for vkAllocateDescriptorSets. Otherwise, variable image2D arrays won't work (but sampler2D arrays do, at least on my setup). Updates #38 Signed-off-by: Elias Naur <mail@eliasnaur.com>	2021-03-19 12:50:12 +01:00
Elias Naur	07e07c7544	ensure consistent path segment transformation As described in #62, the non-deterministic scene monoid may result in slightly different transformations for path segments in an otherwise closed path. This change ensures consistent transformation across paths in three steps. First, absolute transformations computed by the scene monoid is stored along with path segments and annotated elements. Second, elements.comp no longer transforms path segments. Instead, each segment is stored untransformed along with a reference to its absolute transformation. Finally, path_coarse performs the transformation of path segments. Because all segments in a path share a single transformation reference, the inconsistency in #62 is avoided. Fixes #62 Signed-off-by: Elias Naur <mail@eliasnaur.com>	2021-03-19 12:45:23 +01:00
Elias Naur	ad444f615c	elements.comp: use shared array of structs directly The NVIDIA shader compiler bug that forced splitting of the state struct into primitive types is now fixed. Signed-off-by: Elias Naur <mail@eliasnaur.com>	2021-03-19 12:45:23 +01:00
Elias Naur	fd746ea7a6	name and comment magic constant Follow-up to review of PR #61. Signed-off-by: Elias Naur <mail@eliasnaur.com>	2021-03-19 12:45:23 +01:00
Elias Naur	eaa714644e	correct signature generation of non-memory enum writer Signed-off-by: Elias Naur <mail@eliasnaur.com>	2021-03-19 12:45:23 +01:00
Elias Naur	79d722df48	remove unused commands from pathseg Signed-off-by: Elias Naur <mail@eliasnaur.com>	2021-03-19 12:45:23 +01:00
Tatsuyuki Ishi	3557df24c5	Merge pull request #73 from ishitatsuyuki/close-path Always close fill paths, fix #68 Close #68 Close #74	2021-03-17 01:24:09 +09:00
Ishi Tatsuyuki	8a499bc50e	Always close fill paths, fix #68	2021-03-17 01:16:00 +09:00
Raph Levien	22e410c13e	Merge pull request #69 from linebender/encode_tag_flags Add encoding of flags into tag	2021-03-09 07:34:47 -08:00
Raph Levien	9afa9b86b6	Add encoding of flags into tag Add a `TagFlags` type. You can write enum variants as `Variant(TagFlags, Content)` and this will encode a u16 into the top 16 bits of the tag word.	2021-03-02 11:56:02 -08:00
Elias Naur	b73eabf4eb	kernel4.comp: remove unused commands Signed-off-by: Elias Naur <mail@eliasnaur.com>	2021-02-24 15:32:24 +01:00
Elias Naur	6a4e26ef2a	all: add optional memory checks Defining MEM_DEBUG in mem.h will add a size field to Alloc and enable bounds and alignment checks for every memory read and write. Notes: - Deriving an Alloc from Path.tiles is unsound, but it's more trouble to convert Path.tiles from TileRef to a variable sized Alloc. - elements.comp note that "We should be able to use an array of structs but the NV shader compiler doesn't seem to like it". If that's still relevant, does the shared arrays of Allocs work? Signed-off-by: Elias Naur <mail@eliasnaur.com>	2021-02-15 16:07:45 +01:00
Elias Naur	ee67a0a515	kernel4: simplify a tiny bit Signed-off-by: Elias Naur <mail@eliasnaur.com>	2020-12-27 20:24:29 +01:00
Elias Naur	716517cc04	coarse,binning: organize bins into width_in_bins x height_in_bins The binning shader supports up to N_TILE bins. To efficiently cover wide or tall viewports, convert the rigid N_TILE_X x N_TILE_Y bin layout to a variable width_in_bins x height_in_bins layout. Signed-off-by: Elias Naur <mail@eliasnaur.com>	2020-12-27 20:24:29 +01:00
Elias Naur	ef4ec772ad	backdrop: repair unsound optimization Signed-off-by: Elias Naur <mail@eliasnaur.com>	2020-12-27 20:24:29 +01:00
Elias Naur	8b62022749	backdrop: avoid a (benign) zero-sized read Found with MEM_DEBUG added in later change. Signed-off-by: Elias Naur <mail@eliasnaur.com>	2020-12-27 20:24:29 +01:00
Elias Naur	c4f5a69a0d	implement variable output sizing Signed-off-by: Elias Naur <mail@eliasnaur.com>	2020-12-27 20:24:29 +01:00
Elias Naur	c67696714b	coarse.comp: don't write Cmd_End to tiles out of bounds If WIDTH_IN_TILES or HEIGHT_IN_TILES are not divisible by N_TILE_X or N_TILE_Y respectively, the previously unconditional Cmd_End_write would write out of bounds. Signed-off-by: Elias Naur <mail@eliasnaur.com>	2020-12-27 20:24:29 +01:00
Elias Naur	4de67d9081	unify GPU memory management Merge all static and dynamic buffers to just one, "memory". Add a malloc function for dynamic allocations. Unify static allocation offsets into a "config" buffer containing scene setup (number of paths, number of path segments), as well as the memory offsets of the static allocations. Finally, set an overflow flag when an allocation fail, and make sure to exit shader execution as soon as that triggers. Add checks before beginning execution in case the client wants to run two or more shaders before checking the flag. The "state" buffer is left alone because it needs zero'ing and because it is accessed with the "volatile" keyword. Fixes #40 Signed-off-by: Elias Naur <mail@eliasnaur.com>	2020-12-27 20:24:29 +01:00
Elias Naur	a2a2d12c5d	path_coarse.comp: fix intersection inconsistencies, take 2 The previous attempt to fix inconsistent intersections because of floating point inaccuracy[0] missed two cases. The first case is that for top intersections with the very first row would fail the test tag == PathSeg_FillCubic && y > y0 && xbackdrop < bbox.z In particular, y is not larger than y0 when y0 has been clipped to 0. Fix that by re-introducing the min(p0.y, p1.y) < tile_y0 check that does work and is just as consistent. Add similar check, min(p0.x, p1.x) < tile_x0, for deciding when to clip the segment to the left edge (but keep consistent xray check for deciding left edge intersections). The second case is that the tracking left intersections in the [xray, next_xray] range of tiles may fail when next_xray is forced to last_xray, the final xray value. Fix that case by computing next_xray explicitly, before looping over the x tiles. The code is now much simpler. Finally, ensure that xx0 and xx1 doesn't overflow the allocated number of tiles by clamping them after setting them. Adjust xx0 to include xray, just as xx1 is adjusted; I haven't seen corruption without it, but it's not obvious xx0 always includes xray. While here, replace a "+=" on a guaranteed zero value to just "=". Updates #23 [0] `29cfb8b63e` Signed-off-by: Elias Naur <mail@eliasnaur.com>	2020-12-27 20:24:29 +01:00
Elias Naur	14e922e17e	Merge pull request #56 from eliasnaur/master add SPDX license headers	2020-12-12 01:06:33 +01:00
Elias Naur	d21f2b68de	all: add SPDX license headers Fixes #53 Signed-off-by: Elias Naur <mail@eliasnaur.com>	2020-12-11 18:24:35 +01:00
Raph Levien	3017bc41d4	Merge pull request #57 from linebender/docs_update Minor tweaks to vision doc	2020-12-11 09:15:00 -08:00
Raph Levien	ee770bc34d	Minor tweaks to vision doc	2020-12-11 07:31:21 -08:00
Elias Naur	5c04e4882b	remove unused tilegroup.h and extra spaces from kernel4.comp Signed-off-by: Elias Naur <mail@eliasnaur.com>	2020-12-11 15:00:58 +01:00
Elias Naur	59fc612c19	Merge pull request #55 from eliasnaur/master elements.comp: tighten state size calculations	2020-12-11 10:33:53 +01:00
Raph Levien	a711310917	Merge pull request #54 from linebender/docs Add vision document	2020-12-10 10:43:32 -08:00
Raph Levien	7c7417c9c6	Link issues	2020-12-10 10:04:18 -08:00
Elias Naur	580b63e558	elements.comp: tighten state size calculations The state header is only one word (flags), not two. Move the partition atomic counter to a separate field instead of state[0], simplifying state offset calculations. Signed-off-by: Elias Naur <mail@eliasnaur.com>	2020-12-10 18:48:16 +01:00
Raph Levien	078440563c	Link Minikin	2020-12-10 09:21:07 -08:00
Raph Levien	a34b2ce648	Fix minor formatting glitches	2020-12-10 09:19:48 -08:00
Raph Levien	abf8aeef1f	Add vision document	2020-12-10 09:15:50 -08:00
Raph Levien	56aaf7c19a	Merge pull request #52 from linebender/fix_query_pool Fetch correct query pool	2020-12-09 22:42:23 -08:00
Raph Levien	769d71915e	Fetch correct query pool It should fetch the last query pool, and was off by one. That worked on my machine (windows), but did cause panics.	2020-12-09 08:34:01 -08:00
Elias Naur	468a7b8054	Merge pull request #51 from eliasnaur/master remove unused BinChunk type	2020-12-08 00:57:45 +01:00
Elias Naur	1c6ca7e5fb	remove unused BinChunk type Signed-off-by: Elias Naur <mail@eliasnaur.com>	2020-12-08 00:45:08 +01:00
Raph Levien	2eecaa8f0b	Merge pull request #46 from linebender/image_work Progress towards images	2020-12-02 12:35:06 -08:00
Raph Levien	634530fb91	Merge branch 'master' into image_work	2020-12-02 11:58:45 -08:00
Raph Levien	3906f348fd	Merge pull request #47 from linebender/clip_opt Optimize clips	2020-12-02 11:57:14 -08:00
Elias Naur	e582f6b388	Merge pull request #50 from eliasnaur/master Avoid inconsistent line-tile intersections	2020-12-02 20:03:10 +01:00
Elias Naur	29cfb8b63e	eliminate inconsistent line intersections from path_coarse.comp The finite precision of floating point computations can lead the coarse renderer into inconsistent tile intersections, which implies impossible line segments such as lines with gaps or double intersections. The winding number algorithm is sensitive to these errors which show up as incorrectly filled paths. This change forces all intersections to be consistent. First, the floating point top edge intersection test is removed; top edge intersections are completely determined by left edge intersections. Then, left edge intersections are inserted from the tile with the last top edge intersection. The next top edge is then fixed to be the last tile with a left edge intersection. More details in the patch comments. Fixes #23 Signed-off-by: Elias Naur <mail@eliasnaur.com>	2020-12-01 18:35:29 +01:00
Elias Naur	19f4d9fa95	change tile segment representation to (origin, vector) Eliminates the precision loss of the subtraction in the sign(end.x - start.x) expression in kernel4. That's important for the next change that avoids inconsistent line intersections in path_coarse. Updates #23 Signed-off-by: Elias Naur <mail@eliasnaur.com>	2020-12-01 18:34:40 +01:00
Elias Naur	2068171f96	path_coarse.comp: tighten variable scopes, delete unused variables No functional changes. Signed-off-by: Elias Naur <mail@eliasnaur.com>	2020-12-01 18:01:04 +01:00
Raph Levien	97dcb5122e	Merge branch 'master' into image_work	2020-11-29 17:09:48 -08:00
Raph Levien	b8ea1e35cf	Merge branch 'master' into clip_opt	2020-11-29 17:07:46 -08:00
Elias Naur	c50d3f17ea	Merge pull request #48 from eliasnaur/master Remove FillMask, unused types	2020-11-30 01:10:50 +01:00
Elias Naur	feeb459fa1	remove FillMask and FillMaskInv Obsoleted by BeginClip/EndClip. Updates #36 Signed-off-by: Elias Naur <mail@eliasnaur.com>	2020-11-29 16:59:58 +01:00
Elias Naur	bd450ef461	piet-gpu-types: remove unused Segment and SegChunk types Signed-off-by: Elias Naur <mail@eliasnaur.com>	2020-11-29 16:51:35 +01:00
Raph Levien	4138f8a516	Optimize clips Optimize tiles with clip masks that are all-zero or all-one. Part of #36	2020-11-27 09:30:35 -08:00

... 12 13 14 15 16 ...

853 commits