o Iñh ã@s¨UddlZddlZddlmZmZddlmZddlZddlm Z m Z Gdd„dƒZdeded efd d„Z dd „Zeaeed<ejdd„ƒZGdd„dƒZddd„ZdS)éN)ÚCallableÚOptional)Ú deprecated)ÚKernelÚRegistrationHandlec@s4eZdZdZdefdd„Zdededefdd „Zd S)ÚFakeImplHolderz0A holder where one can register an fake impl to.ÚqualnamecCs||_d|_d|_dS©N)rÚkernelÚlib)Úselfr©r úL/var/www/vscode/kcb/lib/python3.10/site-packages/torch/_library/fake_impl.pyÚ__init__s zFakeImplHolder.__init__ÚfuncÚsourceÚreturncsÒˆjdurtdˆj›dˆjj›dƒ‚tj ˆjd¡r$tdˆj›dƒ‚tj ˆjd¡r5tdˆj›dƒ‚t||ƒˆ_ˆjdurPˆj d ¡d }tj |d¡ˆ_tˆjˆƒ}ˆj ˆj|d¡‡fdd „}t|ƒS)z}Register an fake impl. Returns a RegistrationHandle that one can use to de-register this fake impl. Nz!register_fake(...): the operator z( already has an fake impl registered at Ú.ÚMetaz´ already has an DispatchKey::Meta implementation via a pre-existing torch.library or TORCH_LIBRARY registration. Please either remove that registration or don't call register_fake.ÚCompositeImplicitAutograda% already has an implementation for this device type via a pre-existing registration to DispatchKey::CompositeImplicitAutograd.CompositeImplicitAutograd operators do not need an fake impl; instead, the operator will decompose into its constituents and those can have fake impls defined on them.z::rÚFRAGMENTcs ˆjrˆj ¡dˆ_dˆ_dSr )rÚ_destroyr r ©rr rÚderegister_fake_classAs z6FakeImplHolder.register..deregister_fake_class)r ÚRuntimeErrorrrÚtorchÚ_CÚ%_dispatch_has_kernel_for_dispatch_keyrrÚsplitÚlibraryÚLibraryÚconstruct_meta_kernelÚimplr)rrrÚnsÚmeta_kernelrr rrÚregisters0 þÿÿÿÿ zFakeImplHolder.registerN) Ú__name__Ú __module__Ú__qualname__Ú__doc__Ústrrrrr%r r r rrsrrÚfake_impl_holderrcs.ˆjdusJ‚t ˆjj¡‡‡fdd„ƒ}|S)Ncs`ˆjdusJ‚ˆjj‰‡‡fdd„}t|ƒˆj|i|¤ŽWdƒS1s)wYdS)Ncstˆ›dˆ›dƒ‚)Nz (a¿): You're trying to run this operator with meta Tensors (as opposed to FakeTensors), but this operator may return an output Tensor with data-dependent shape. Meta Tensors don't support operators with outputs that have data-dependent shapes but FakeTensors do. If your operator does not return an output with data-dependent shape, make sure the FakeTensor and/or meta kernel does not call torch.library.get_ctx(). Otherwise, please use FakeTensors.)rr )rrr rÚerror_on_ctxRsÿz@construct_meta_kernel..meta_kernel..error_on_ctx)r rÚset_ctx_getter)ÚargsÚkwargsr,©r+r)rrr$Ms $ÿz*construct_meta_kernel..meta_kernel)r Ú functoolsÚwrapsr)rr+r$r r0rr!Jsr!cCsdSr r r r r rÚget_nonedsr3Úglobal_ctx_getterccs"t}z |adVW|adS|awr )r4)Ú ctx_getterÚprevr r rr-ks€r-c@sTeZdZdZdd„Zededdddœd ejfd d„ƒZ dddœd ejfd d„Z dS)ÚFakeImplCtxzO Context object for writing fake implementations for custom operators. cCs||_|j|_||_dSr )Ú _fake_modeÚ shape_envÚ _shape_envÚ_op)rr8r;r r rr{s zFakeImplCtx.__init__zM`create_unbacked_symint` is deprecated, please use `new_dynamic_size` instead)ÚcategoryéN©ÚminÚmaxrcCs|j||dS©Nr>)Únew_dynamic_size©rr?r@r r rÚcreate_unbacked_symint€sz"FakeImplCtx.create_unbacked_symintrcCsv|jdus |jjstjj |j¡‚t|tjƒst|tjƒr(t d|›d|›dƒ‚|dkr4t d|›dƒ‚t |j||ƒS)a Constructs a new symint (symbolic int) representing a data-dependent value. This is useful for writing the fake implementation (which is necessary for torch.compile) for a CustomOp where an output Tensor has a size that depends on the data of the input Tensors. Args: min (int): A statically known inclusive lower bound for this symint. Default: 0 max (Optional[int]): A statically known inclusive upper bound for this symint. Default: None .. warning: It is important that the ``min`` and ``max`` (if not None) values are set correctly, otherwise, there will be undefined behavior under torch.compile. The default value of ``min`` is 2 due to torch.compile specializing on 0/1 sizes. You must also verify that your implementation on concrete Tensors (e.g. CPU/CUDA) only returns Tensors where the size that corresponds to the symint also has respects these constraint. The easiest way to do this is to add an assertion in the CPU/CUDA/etc implementation that the size follows these bounds. Example:: >>> # An operator with data-dependent output shape >>> lib = torch.library.Library("mymodule", "FRAGMENT") >>> lib.define("mymodule::custom_nonzero(Tensor x) -> Tensor") >>> >>> @torch.library.register_fake("mymodule::custom_nonzero") >>> def _(x): >>> # Number of nonzero-elements is data-dependent. >>> # Since we cannot peek at the data in an fake impl, >>> # we use the ctx object to construct a new symint that >>> # represents the data-dependent size. >>> ctx = torch.library.get_ctx() >>> nnz = ctx.new_dynamic_size() >>> shape = [nnz, x.dim()] >>> result = x.new_empty(shape, dtype=torch.int64) >>> return result >>> >>> @torch.library.impl(lib, "custom_nonzero", "CPU") >>> def _(x): >>> x_np = x.numpy() >>> res = np.stack(np.nonzero(x_np), axis=1) >>> return torch.tensor(res, device=x.device) Nzctx.new_dynamic_size(min=z, max=zZ): expected min and max to be statically known ints but got SymInt. This is not supported.rzc, ...): expected min to be greater than or equal to 0: this API can only create non-negative sizes.)r:Úallow_dynamic_output_shape_opsrÚ_subclassesÚfake_tensorÚDynamicOutputShapeExceptionr;Ú isinstanceÚSymIntÚ ValueErrorÚ allocate_sizerCr r rrB‡s 3ÿÿ ÿzFakeImplCtx.new_dynamic_size)r&r'r(r)rrÚ FutureWarningrrJrDrBr r r rr7vsþr7cCs"| ¡}tjjjj|||d|SrA)rDrÚfxÚexperimentalÚsymbolic_shapesÚ_constrain_range_for_size)r9Úmin_valÚmax_valÚresultr r rrLÐs ÿrL)rN)Ú contextlibr1ÚtypingrrÚtyping_extensionsrrÚtorch._library.utilsrrrr*r!r3r4Ú__annotations__Úcontextmanagerr-r7rLr r r rÚs ? Z