vello

alex/vello

mirror of https://github.com/italicsjenga/vello.git synced 2025-01-11 04:51:32 +11:00

Author	SHA1	Message	Date
Raph Levien	95081971fe	Add HAL method to update buffer descriptor This is WIP because only the Metal implementation is added. Part of the work for #175	2022-06-23 12:05:09 -07:00
Chad Brokaw	60bca997e5	Remove deprecated functions * remove CmdBuff::dispatch() which was moved to ComputePass * remove CmdBuff::write_timestamp() which is replaced by timestamp index pair in ComputePassDescriptor	2022-05-04 01:59:49 -04:00
Chad Brokaw	41316ccd4b	Add debug label functions to ComputePass	2022-05-04 01:56:16 -04:00
Raph Levien	ea0fbab8aa	Add memory barrier to ComputePass We need to be able to call memory_barrier() on ComputePass, to avoid the borrow checker complaining if we tried to call it on the underlying command buffer.	2022-04-20 15:13:54 -07:00
Raph Levien	58836244a4	Change signature of ComputePass::end() to self As per review comment.	2022-04-20 09:19:22 -07:00
Raph Levien	9980c858b6	Fix timer queries in Vulkan and DX12 backends Current status: the piet-gpu-hal module (including the collatz example) have the new API (with queries set on compute pass) implemented. The other uses have not yet been updated. On Metal, only M1 is tested. The "command" counter style is partly implemented, but not fully wired up.	2022-04-14 17:17:33 -07:00
Raph Levien	ba2b27cc3c	Rework of compute encoder abstraction The current plan is to more or less follow the wgpu/wgpu-hal approach. In the mux/backend layer (which corresponds fairly strongly to wgpu-hal), there isn't explicit construction of a compute encoder, but there are new methods for beginning and ending a compute pass. At the hub layer (which corresponds to wgpu) there will be a ComputeEncoder object. That said, there will be some differences. The WebGPU "end" method on a compute encoder is implemented in wgpu as Drop, and that is not ideal. Also, the wgpu-hal approach to timer queries (still based on write_timestamp) is not up to the task of Metal timer queries, where the query offsets have to be specified at compute encoder creation. That's why there are different projects :) WIP: current state is that stage-style queries work on Apple Silicon, but non-Metal backends are broken, and piet-gpu is not yet updated to use new API.	2022-04-14 10:19:28 -07:00
Raph Levien	012d679e3d	Merge branch 'master' into mtl_guest	2022-02-07 17:28:52 -08:00
Raph Levien	b2e7c80d3b	Fix non-mac builds Also updates comment. We know the implementation is incomplete and needs refinement, but it seems useful to commit as a starting point for further work.	2022-02-07 13:54:24 -08:00
Tatsuyuki Ishi	b5ffecba17	piet-gpu, hal: Add Vulkan debug markers Other backends are left as stub for now. Makes it easier to identify stages within Radeon GPU Profiler and others.	2022-01-21 11:34:34 +09:00
Raph Levien	0cf370f9c7	Mostly working rendering This exposes interfaces to render glyphs into a texture atlas. The main changes are: * Methods to plumb raw Metal GPU resources (device, texture, etc) into piet-gpu-hal objects. * A new glyph_render API specialized to rendering glyphs. This is basically the same as just painting to a canvas, but will allow better caching (and has more direct access to fonts, bypassing the Piet font type which is underdeveloped). * Ability to render to A8 target in addition to RGBA. WIP, there are some rough edges, not least of which is that the image format changes are only on mac and cause compile errors elsewhere.	2022-01-19 12:10:51 -08:00
Raph Levien	833d993a4e	More progress exposing interface Much of the surface area exists for rendering now. WIP of course still	2022-01-18 18:41:28 -08:00
Raph Levien	178761dcb3	Path stream processing This patch contains the core of the path stream processing, though some integration bits are missing. The core logic is tested, though combinations of path types, transforms, and line widths are not (yet). Progress towards #119	2021-12-01 07:33:24 -08:00
Raph Levien	9fb2ae91eb	Access buffer data through mapping This patch includes a number of changes to encourage reading and writing buffers through mapping rather than copying data as before. This includes a new `BufWrite` abstraction which is designed for filling buffers. It behaves much like a Vec<u8>, but with fixed capacity.	2021-11-25 21:27:08 -08:00
Raph Levien	2ebdd942cf	Use bytemuck Get rid of `PlainData` trait and use `Pod` from bytemuck instead.	2021-11-23 08:24:16 -08:00
Raph Levien	657f219ce8	Better DX12 descriptor management Reduce allocation of descriptor heaps. This change also enables clearing of buffers, as the handles are needed at command dispatch time. Also updates the tests to use clear_buffers on DX12. Looking forward to being able to get rid of the compute shader workaround on Metal. This is a followup on #125, and progress toward #95	2021-11-20 16:36:43 -08:00
Raph Levien	f9d0aa078b	Use DXIL shader compilation Integrate DXC for translating HLSL for use in DX12. This will work around FXC limitations and unlock the use of more advanced HLSL features such as subgroups. This hardcodes the use of DXIL, but it could be adapted (with a bit of effort) to choose between DXIL and HLSL at runtime.	2021-11-11 12:55:10 -08:00
Raph Levien	94949a6906	Mac port of bind layout rework This gets it working on mac. Also delete old implementation. There's also an update to winit 0.25 in here, because it was easier to roll forward than fix inconsistent Cargo.lock. At some point, we should systematically update all deps.	2021-11-10 13:40:16 -08:00
Raph Levien	74f2b4fd1c	Rework bind layout Use an array of bindtypes rather than the previous situation, which was a choice of buffer counts, or a heavier builder pattern. The main thing this unlocks is distinguishing between readonly and read/write buffers, which is important for DX12. This is WIP, the Metal part hasn't been done, and the old stuff not deleted. Part of #125	2021-11-10 11:25:16 -08:00
Raph Levien	59e850a7b1	Reuse command buffers Reuse submitted command buffers rather than continually allocating them. This patch also improves the story across the different backends. On DX12 it was reusing allocators without resetting them, which could be a leak. And on Metal the reset "fails," so there's always a new alloc.	2021-10-23 09:21:19 -07:00
Raph Levien	b423e6c25d	Warning cleanup This patch gets rid of warnings and runs cargo fmt. A lot of the warnings were unused items (especially in DX12 land). At some point we might want to bring some of that back, at which point it might be useful to refer to what was deleted in this commit.	2021-10-23 09:21:19 -07:00
Raph Levien	f73da22a69	Clean up command buffers This patch deallocates command buffers after command submission completes (the same time as other resources are released). It should be portable and robust on all back-ends, but not necessarily the most efficient. But reuse of command buffers, as well as more efficient allocation on Vulkan and DX12, are for followup work.	2021-10-19 17:17:41 -07:00
Raph Levien	056446c23d	Cargo fmt	2021-09-06 10:19:55 -07:00
Raph Levien	4b2a720289	Animating scene Make the scene dependent on timing. This commit patches the HAL to reuse command buffers; this works well on Vulkan and prevents a leak, but breaks the other back-ends. That will require a solution, possibly including plumbing up the resource lifetime responsibilities to the client. Other things might be hacky as well.	2021-09-06 10:18:14 -07:00
Raph Levien	05e81acebc	Basically get gradients working Separate out render context upload from renderer creation. Upload ramps to GPU buffer. Encode gradients to scene description. Fix a number of bugs in uploading and processing. This renders gradients in a test image, but has some shortcomings. For one, staging buffers need to be applied for a couple things (they're just host mapped for now). Also, the interaction between sRGB and premultiplied alpha isn't quite right. The size of the gradient ramp buffer is fixed and should be dynamic. And of course there's always more optimization to be done, including making the upload of gradient ramps more incremental, and probably hashing of the stops instead of the processed ramps.	2021-08-09 16:16:46 -07:00
Raph Levien	074fafad1e	Turn off reuse of command buffers for now It worked ok on Vulkan but is causing problems on DX12 and Metal. Punt for now and come back to this later when we do more sophisticated resource management.	2021-05-31 21:09:15 -07:00
Raph Levien	bae185efbd	API reorg Move types into the toplevel and hide implementation details. Remove deref of hub CmdBuf to mux. Restrict public visibility of internals. Most items have some docs, though improvements are still possible. In particular, there should be detailed safety info.	2021-05-29 21:11:02 -07:00
Raph Levien	7d7c86c44b	API changes and cleanup Add workgroup size to dispatch call (needed by metal). Change all fence references to mutable for consistency. Move backend traits to a separate file (move them out of the toplevel namespace in preparation for the hub types going there, to make the public API nicer). Add a method and macro for automatically choosing shader code, and change collatz example to generate all 3 kinds on build.	2021-05-28 16:14:39 -07:00
Raph Levien	b6292c644f	Make fences mutable Change the interface for fences to accept mutable references. This will actualy help the Metal backend more than dx12 (avoiding interior mutability) but more accurately captures intent and matches gfx-hal.	2021-05-27 15:53:12 -07:00
Raph Levien	2ecfc7a414	Wire hub to mux Make the hub abstraction connect to the mux, rather than directly to the Vulkan back-end. As of this commit, both command line and winit examples work (on Vulkan). In theory it should be possible to get them working on Dx12 as well by translating the shader code, but there's a lot that can go wrong. This commit also contains a bunch of changes to mux to make conditional compilation of match arms work, and new methods to support swapchain.	2021-05-26 09:30:07 -07:00
Raph Levien	f04da3af9d	Add multiplexer abstraction Adds a new "mux" module which can have multiple backends. As of this commit, it's not wired up at all, but the functionality should be reasonably complete. Minor tweaks to the backend trait to accommodate this, mostly changing Fence and Semaphore to references so they don't need to be Copy. Part of the work toward #95	2021-05-25 15:12:37 -07:00
Raph Levien	174c81ec09	Cleanup Fix bound on blanket RetainResource impl. Clean up run_cmd_buf.	2021-05-24 15:42:25 -07:00
Raph Levien	22935fccc6	Use const generics for IntoRefs Yay! Now we can use an array of any size, slice, or vector.	2021-05-24 14:25:13 -07:00
Raph Levien	0cc72d9765	Reduce allocations for retaining resources Use an enum instead of Box<dyn Any> for resources to be retained until command buffer completion, and allow both references (which will be cloned) and owned resources (useful for staging buffers).	2021-05-24 14:10:28 -07:00
Raph Levien	47d2e0a756	Add create_buffer_init method Add a method to create a buffer with initial content, which requires staging buffers under the hood. This patch also changes the lower-level (Vulkan) interface to be closer to the raw Vulkan call.	2021-05-24 13:18:11 -07:00
Raph Levien	e9a8b4643b	Migrate to BufferUsage Adopt the BufferUsage concept from WebGPU, and replace MemFlags, which is inadequate.	2021-05-21 19:43:55 -07:00
Raph Levien	a5991ecf97	Expand runtime query of GPU capabilities Test whether the GPU supports subgroups (including size control) and memory model. This patch does all the ceremony needed for runtime query, including testing the Vulkan version and only probing the extensions when available. Thus, it should work fine on older devices (not yet tested). The reporting of capabilities follows Vulkan concepts, but is not particularly Vulkan-specific.	2021-05-08 11:41:47 -07:00
Raph Levien	115cb855d9	Query extensions at runtime Don't run extensions unless they're available. This includes querying for descriptor indexing, and running one of two versions of kernel4 depending on whether it's enabled. Part of the support needed for #78	2021-04-08 15:11:15 -07:00
Elias Naur	903ab1fb59	implement FillImage command and sRGB support FillImage is like Fill, except that it takes its color from one or more image atlases. kernel4 uses a single image for non-Vulkan hosts, and the dynamic sized array of image descriptors on Vulkan. A previous version of this commit used textures. I think images are a better choice for piet-gpu, for several reasons: - Texture sampling, in particular textureGrad, is slow on lower spec devices such as Google Pixel. Texture sampling is particularly slow and difficult to implement for CPU fallbacks. - Texture sampling need more parameters, in particular the full u,v transformation matrix, leading to a large increase in the command size. Since all commands use the same size, that memory penalty is paid by all scenes, not just scenes with textures. - It is unlikely that piet-gpu will support every kind of fill for every client, because each kind must be added to kernel4. With FillImage, a client will prepare the image(s) in separate shader stages, sampling and applying transformations and special effects as needed. Textures that align with the output pixel grid can be used directly, without pre-processing. Note that the pre-processing step can run concurrently with the piet-gpu pipeline; Only the last stage, kernel4, needs the images. Pre-processing most likely uses fixed function vertex/fragment programs, which on some GPUs may run in parallel with piet-gpu's compute programs. While here, fix a few validation errors: - Explicitly enable EXT_descriptor_indexing, KHR_maintenance3, KHR_get_physical_device_properties2. - Specify a vkDescriptorSetVariableDescriptorCountAllocateInfo for vkAllocateDescriptorSets. Otherwise, variable image2D arrays won't work (but sampler2D arrays do, at least on my setup). Updates #38 Signed-off-by: Elias Naur <mail@eliasnaur.com>	2021-03-19 12:50:12 +01:00
Raph Levien	facc9e0982	Use sampler for texture images Provide images to fine rasterization kernel as readonly textures with a sampler, rather than storage images. That lets us use the GPU's hardware for sampling, which should be considerably more efficient. There are a bunch of parameters that are hardcoded, but it does seem to work.	2020-11-25 18:05:10 -08:00
Raph Levien	047a0830d1	Towards wiring up images to k4 This patch passes a dynamically sized array of textures to the fine rasterizer. A bunch of the low level Vulkan stuff is done, but only enough of the shaders and encoders to do minimal testing. We'll want to switch from storage images to sampled images, track the actual array of textures during encoding, use that to build the descriptor set (which will need to be more dynamic), and of course run image elements through the pipeline. Progress towards #38	2020-11-24 22:11:38 -08:00
Raph Levien	6b06d249ab	Builder pattern for pipelines Use a builder pattern for pipelines and descriptor sets, so we can go richer without hugely complicating existing code. WIP	2020-11-24 22:11:38 -08:00
Raph Levien	47e24ec9d5	Start adding support for creating images This is still WIP, focused on creating image resources and making them available GPU-side. Progress toward #38	2020-11-19 16:32:29 -08:00
Raph Levien	75c4b62730	Add hub abstraction The hub does a little better lifetime tracking of resources (so Rust-side references can be dropped), and in the future will be used for dynamic selection of backend. The migration is still a bit half-baked, as there are a bunch of Vulkan-specific types in the signatures, but it shouldn't be too much work to sort that out. Perhaps it can wait until there is a second backend though. The main motivation for this is to create image objects with lifetime tracking, one of the things required for #38.	2020-11-18 16:06:08 -08:00

44 commits