[Quant] prepare_pt2e model with tied weights will raise ValueError when using export_for

[Quant] prepare_pt2e model with tied weights will raise ValueError when using export_for_training

Error

W1204 00:25:55.768000 4063995 site-packages/torch/_export/__init__.py:64] +============================+

W1204 00:25:55.769000 4063995 site-packages/torch/_export/__init__.py:65] | !!! WARNING !!! |

W1204 00:25:55.770000 4063995 site-packages/torch/_export/__init__.py:66] +============================+

W1204 00:25:55.770000 4063995 site-packages/torch/_export/__init__.py:67] capture_pre_autograd_graph() is deprecated and doesn't provide any function guarantee moving forward.

W1204 00:25:55.771000 4063995 site-packages/torch/_export/__init__.py:68] Please switch to use torch.export.export_for_training instead.

class GraphModule(torch.nn.Module):

def forward(self, input_ids):

arg0: "i64[2, 9]";

arg0, = fx_pytree.tree_flatten_spec(([input_ids], {}), self._in_spec)

arg0_1 = arg0

Solution

Solution is to look into the implementation of export_for_training and update it to handle tied weights correctly. This could involve adding specific code to account for the tied weights in the prepare_pt2e model, such as properly propagating updates to the tied parameters during the training export process. Alternatively, if it's feasible, a custom version of the export_for_training function could be created that caters specifically to models with tied weights like the prepare_pt2e model.

Related tags

prepare2pass prepare_tf_dataset prepare_qat prepare_qat_fx prepare_pt2e preparets prepare pt2e

<< 【Solved】from torch._C import * (ImportError: DLL load failed: The specified module could not be found.

【Solved】A failure occurred while executing org.jetbrains.kotlin.compilerRunner.GradleCompilerRunnerWithWorkers$GradleKotlinCompilerWorkAction >>

Power Query Expression.Error: A cyclic reference was encountered during evaluation.

When you got this error, this is generally dut to you referred the data source itself in a formula. Preview.Error: The current preview value is too complex to display. Details: Expression.Error:
Bootstrap 5 align center sample code

Below shows a code sample that aligns a button center. <div class="d-flex justify-content-center"> <button type="submit" class="btn btn-default btn-block my-3" style="backg
How to choose among Excel,Power BI, Access, Python, MySQL?

There are plenty of technologies to collect, process and analyze data, such as Excel,Power BI, Access, Python, MySQL, etc. But how to choose the right technology beween above mentioned technologies is
Xampp: MySQL stops automatically after a few seconds of startup.

I was bitten by mosquitoes last night, and I didn't sleep all night. In the morning, I thought I would work, but Xampp had a problem. Phenomenon MySQL stops automatically after a few seconds of star
Install macOS 10.13 High Sierra and 10.15 Catalina on VMware

Windows PC environment Win10 256 GB SSD Install macOS 10.13 High Sierra Speed after installation: very smooth Problems: (1)font is blur: although it's still acceptable, but the font is a l

Related tags

kotlin version error kotlinlang inductor ripple current inductor design software inductor dcr inductor cross reference inductor warehouse no serializer registered for class class java.lang no serializer registered for class class java.lang no serializer registered for class class ljava.lan no serializer registered for class class java.lang no serializer registered for class tenant testing multi tenant storage test store walmart einops python 3.7 prepare pt2e preparets prepare_pt2e prepare_qat_fx