vello

alex/vello

mirror of https://github.com/italicsjenga/vello.git synced 2025-01-11 04:51:32 +11:00

Author	SHA1	Message	Date
Raph Levien	012d679e3d	Merge branch 'master' into mtl_guest	2022-02-07 17:28:52 -08:00
Raph Levien	b2e7c80d3b	Fix non-mac builds Also updates comment. We know the implementation is incomplete and needs refinement, but it seems useful to commit as a starting point for further work.	2022-02-07 13:54:24 -08:00
Tatsuyuki Ishi	b5ffecba17	piet-gpu, hal: Add Vulkan debug markers Other backends are left as stub for now. Makes it easier to identify stages within Radeon GPU Profiler and others.	2022-01-21 11:34:34 +09:00
Raph Levien	9fb2ae91eb	Access buffer data through mapping This patch includes a number of changes to encourage reading and writing buffers through mapping rather than copying data as before. This includes a new `BufWrite` abstraction which is designed for filling buffers. It behaves much like a Vec<u8>, but with fixed capacity.	2021-11-25 21:27:08 -08:00
Raph Levien	94949a6906	Mac port of bind layout rework This gets it working on mac. Also delete old implementation. There's also an update to winit 0.25 in here, because it was easier to roll forward than fix inconsistent Cargo.lock. At some point, we should systematically update all deps.	2021-11-10 13:40:16 -08:00
Raph Levien	74f2b4fd1c	Rework bind layout Use an array of bindtypes rather than the previous situation, which was a choice of buffer counts, or a heavier builder pattern. The main thing this unlocks is distinguishing between readonly and read/write buffers, which is important for DX12. This is WIP, the Metal part hasn't been done, and the old stuff not deleted. Part of #125	2021-11-10 11:25:16 -08:00
Raph Levien	0a9dd3a386	Actually turn on memory model	2021-11-07 17:03:11 -08:00
Raph Levien	d04ae44618	Upgrade to Ash 0.33 This was motivated by experiments with the Vulkan memory model. To use that, we actually need to explicitly enable the relevant feature on device creation time. That's a lot easier to do now that push_next works on the structs in that chain. This PR doesn't do that though, it only upgrades the dependency and cleans up deprecations.	2021-11-05 14:01:50 -07:00
Raph Levien	c648038967	Merge pull request #117 from linebender/cleanup Reuse command buffers	2021-10-27 07:23:29 -07:00
Rose Hudson	1bdd2a7c86	Vulkan: account for no limit on image count when clamping image count within device bounds, some devices can report max_image_count = 0 to indicate no limit on image count. this triggers assertion in clamp because max < min. therefore if the device reports zero we treat it as `u32::MAX` see https://www.khronos.org/registry/vulkan/specs/1.2-extensions/man/html/VkSurfaceCapabilitiesKHR.html	2021-10-27 10:45:28 +01:00
Raph Levien	59e850a7b1	Reuse command buffers Reuse submitted command buffers rather than continually allocating them. This patch also improves the story across the different backends. On DX12 it was reusing allocators without resetting them, which could be a leak. And on Metal the reset "fails," so there's always a new alloc.	2021-10-23 09:21:19 -07:00
Raph Levien	b423e6c25d	Warning cleanup This patch gets rid of warnings and runs cargo fmt. A lot of the warnings were unused items (especially in DX12 land). At some point we might want to bring some of that back, at which point it might be useful to refer to what was deleted in this commit.	2021-10-23 09:21:19 -07:00
Raph Levien	7adb300671	Double-buffer scene buffer Pipeline the CPU and GPU work so that two frames can be in flight at once. This dramatically improves the performance especially on Android. Note that I've also changed the default configuration to be 3 frames in flight and FIFO mode.	2021-10-21 14:28:27 -07:00
Raph Levien	a3d3f39fbd	Wait on query results This shouldn't be necessary, but was causing NOT_READY errors.	2021-10-19 17:26:45 -07:00
Raph Levien	f73da22a69	Clean up command buffers This patch deallocates command buffers after command submission completes (the same time as other resources are released). It should be portable and robust on all back-ends, but not necessarily the most efficient. But reuse of command buffers, as well as more efficient allocation on Vulkan and DX12, are for followup work.	2021-10-19 17:17:41 -07:00
Raph Levien	6f707c4c62	Start work on gradients WIP. Most of the GPU-side work should be done (though it's not tested end-to-end and it's certainly possible I missed something), but still needs work on encoding side.	2021-07-12 06:56:52 -07:00
Ishi Tatsuyuki	d77dfb8c00	Runtime querying of threadgroup size	2021-06-08 16:29:40 +09:00
Raph Levien	bae185efbd	API reorg Move types into the toplevel and hide implementation details. Remove deref of hub CmdBuf to mux. Restrict public visibility of internals. Most items have some docs, though improvements are still possible. In particular, there should be detailed safety info.	2021-05-29 21:11:02 -07:00
Raph Levien	7d7c86c44b	API changes and cleanup Add workgroup size to dispatch call (needed by metal). Change all fence references to mutable for consistency. Move backend traits to a separate file (move them out of the toplevel namespace in preparation for the hub types going there, to make the public API nicer). Add a method and macro for automatically choosing shader code, and change collatz example to generate all 3 kinds on build.	2021-05-28 16:14:39 -07:00
Raph Levien	b4ba6886d8	Tweak wait_and_reset mutable fence signature A reference to a slice of mutable references is not a thing.	2021-05-27 16:10:14 -07:00
Raph Levien	b6292c644f	Make fences mutable Change the interface for fences to accept mutable references. This will actualy help the Metal backend more than dx12 (avoiding interior mutability) but more accurately captures intent and matches gfx-hal.	2021-05-27 15:53:12 -07:00
Raph Levien	2ecfc7a414	Wire hub to mux Make the hub abstraction connect to the mux, rather than directly to the Vulkan back-end. As of this commit, both command line and winit examples work (on Vulkan). In theory it should be possible to get them working on Dx12 as well by translating the shader code, but there's a lot that can go wrong. This commit also contains a bunch of changes to mux to make conditional compilation of match arms work, and new methods to support swapchain.	2021-05-26 09:30:07 -07:00
Raph Levien	f04da3af9d	Add multiplexer abstraction Adds a new "mux" module which can have multiple backends. As of this commit, it's not wired up at all, but the functionality should be reasonably complete. Minor tweaks to the backend trait to accommodate this, mostly changing Fence and Semaphore to references so they don't need to be Copy. Part of the work toward #95	2021-05-25 15:12:37 -07:00
Raph Levien	dfac2148a9	Merge branch 'staging' into dx12	2021-05-24 15:44:53 -07:00
Raph Levien	174c81ec09	Cleanup Fix bound on blanket RetainResource impl. Clean up run_cmd_buf.	2021-05-24 15:42:25 -07:00
Raph Levien	47d2e0a756	Add create_buffer_init method Add a method to create a buffer with initial content, which requires staging buffers under the hood. This patch also changes the lower-level (Vulkan) interface to be closer to the raw Vulkan call.	2021-05-24 13:18:11 -07:00
Raph Levien	050df66801	Redo memory options for usage Rework the entire mechanism for specifying memory for creating resources, inferring the correct options from the new usage flags.	2021-05-21 22:17:17 -07:00
Raph Levien	3dfae7aed6	Merge branch 'usage' into dx12_work	2021-05-21 22:00:49 -07:00
Raph Levien	e9a8b4643b	Migrate to BufferUsage Adopt the BufferUsage concept from WebGPU, and replace MemFlags, which is inadequate.	2021-05-21 19:43:55 -07:00
Raph Levien	f482921806	Create compute pipelines Create compute pipelines from shader source and descriptor sets. This gets it to the point where it can run the collatz example. Still WIP and with rough edges, of course.	2021-05-18 10:08:23 -07:00
Raph Levien	a5991ecf97	Expand runtime query of GPU capabilities Test whether the GPU supports subgroups (including size control) and memory model. This patch does all the ceremony needed for runtime query, including testing the Vulkan version and only probing the extensions when available. Thus, it should work fine on older devices (not yet tested). The reporting of capabilities follows Vulkan concepts, but is not particularly Vulkan-specific.	2021-05-08 11:41:47 -07:00
Raph Levien	bcb26c931e	Clean up device create extensions	2021-04-08 15:11:17 -07:00
Raph Levien	115cb855d9	Query extensions at runtime Don't run extensions unless they're available. This includes querying for descriptor indexing, and running one of two versions of kernel4 depending on whether it's enabled. Part of the support needed for #78	2021-04-08 15:11:15 -07:00
Elias Naur	22eb418832	fix Vulkan errors on Wayland and Intel GPU capabilities.min_image_count is 4 on my system, which is larger than the hard-coded 2. Use a default swapchain size if we're not getting any size information from the surface capabilities. Signed-off-by: Elias Naur <mail@eliasnaur.com>	2021-03-19 16:43:33 +01:00
Elias Naur	903ab1fb59	implement FillImage command and sRGB support FillImage is like Fill, except that it takes its color from one or more image atlases. kernel4 uses a single image for non-Vulkan hosts, and the dynamic sized array of image descriptors on Vulkan. A previous version of this commit used textures. I think images are a better choice for piet-gpu, for several reasons: - Texture sampling, in particular textureGrad, is slow on lower spec devices such as Google Pixel. Texture sampling is particularly slow and difficult to implement for CPU fallbacks. - Texture sampling need more parameters, in particular the full u,v transformation matrix, leading to a large increase in the command size. Since all commands use the same size, that memory penalty is paid by all scenes, not just scenes with textures. - It is unlikely that piet-gpu will support every kind of fill for every client, because each kind must be added to kernel4. With FillImage, a client will prepare the image(s) in separate shader stages, sampling and applying transformations and special effects as needed. Textures that align with the output pixel grid can be used directly, without pre-processing. Note that the pre-processing step can run concurrently with the piet-gpu pipeline; Only the last stage, kernel4, needs the images. Pre-processing most likely uses fixed function vertex/fragment programs, which on some GPUs may run in parallel with piet-gpu's compute programs. While here, fix a few validation errors: - Explicitly enable EXT_descriptor_indexing, KHR_maintenance3, KHR_get_physical_device_properties2. - Specify a vkDescriptorSetVariableDescriptorCountAllocateInfo for vkAllocateDescriptorSets. Otherwise, variable image2D arrays won't work (but sampler2D arrays do, at least on my setup). Updates #38 Signed-off-by: Elias Naur <mail@eliasnaur.com>	2021-03-19 12:50:12 +01:00
Raph Levien	facc9e0982	Use sampler for texture images Provide images to fine rasterization kernel as readonly textures with a sampler, rather than storage images. That lets us use the GPU's hardware for sampling, which should be considerably more efficient. There are a bunch of parameters that are hardcoded, but it does seem to work.	2020-11-25 18:05:10 -08:00
Raph Levien	047a0830d1	Towards wiring up images to k4 This patch passes a dynamically sized array of textures to the fine rasterizer. A bunch of the low level Vulkan stuff is done, but only enough of the shaders and encoders to do minimal testing. We'll want to switch from storage images to sampled images, track the actual array of textures during encoding, use that to build the descriptor set (which will need to be more dynamic), and of course run image elements through the pipeline. Progress towards #38	2020-11-24 22:11:38 -08:00
Raph Levien	6b06d249ab	Builder pattern for pipelines Use a builder pattern for pipelines and descriptor sets, so we can go richer without hugely complicating existing code. WIP	2020-11-24 22:11:38 -08:00
Raph Levien	a60c2dd3c8	Scratch buffer for clip stack We keep a small window of the clip stack in registers in the fine rasterization kernel, and when that window is exceeded, spill to global memory, so the clip stack can be unbounded.	2020-11-22 18:14:09 -08:00
Raph Levien	47e24ec9d5	Start adding support for creating images This is still WIP, focused on creating image resources and making them available GPU-side. Progress toward #38	2020-11-19 16:32:29 -08:00
Raph Levien	75c4b62730	Add hub abstraction The hub does a little better lifetime tracking of resources (so Rust-side references can be dropped), and in the future will be used for dynamic selection of backend. The migration is still a bit half-baked, as there are a bunch of Vulkan-specific types in the signatures, but it shouldn't be too much work to sort that out. Perhaps it can wait until there is a second backend though. The main motivation for this is to create image objects with lifetime tracking, one of the things required for #38.	2020-11-18 16:06:08 -08:00
Raph Levien	301abf4db7	Minor cleanups Mostly cleaning up some comments. Also adds host barrier and a command to copy a buffer to an image (in preparation for images, see #38).	2020-11-17 14:18:30 -08:00
Alexey Zatelepin	9667c34e7e	Fix read mapping size (size is number of bytes, not number of items)	2020-07-02 21:32:17 +03:00
Raph Levien	8d01aba237	Update to piet 0.13 Get rid of kurbo patch, as we now use kurbo through piet. Also clean up some warnings.	2020-05-12 08:26:48 -07:00
msiglreith	abd238bff3	Address review comments	2020-05-05 18:13:07 +02:00
msiglreith	b38e43f0c2	Initial work for surface support surface: handle extensions Implement swapchain creation and blit image to screen	2020-05-04 16:24:42 +02:00
Raph Levien	1cecde348b	Merge pull request #5 from msiglreith/reelase_build Fix release build	2020-04-30 07:30:42 -07:00
msiglreith	f0c514a667	Fix release build	2020-04-30 15:48:01 +02:00
Raph Levien	aa8b71e922	Reset query pool before use Quiets validation errors now that we can see them :)	2020-04-29 18:18:04 -07:00
Brian Merchant	2e3335f049	Set up validation layers and debug reporting.	2020-04-29 17:59:14 -07:00

1 2

58 commits