Broken Colab Demo

Hello, I wanted to test metavoice using official Colab Demo but I am getting this error during inference:

`using dtype=float16
Fetching&#8199;6&#8199;files:&#8199;100%
&#8199;6/6&#8199;[00:00<00:00,&#8199;293.22it/s]
number of parameters: 14.07M
2024-07-13 18:39:34 | INFO     | DF | Loading model settings of DeepFilterNet3
2024-07-13 18:39:34 | INFO     | DF | Using DeepFilterNet3 model at /root/.cache/DeepFilterNet/DeepFilterNet3
2024-07-13 18:39:34 | INFO     | DF | Initializing model `deepfilternet3`
2024-07-13 18:39:34 | INFO     | DF | Found checkpoint /root/.cache/DeepFilterNet/DeepFilterNet3/checkpoints/model_120.ckpt.best with epoch 120
2024-07-13 18:39:34 | INFO     | DF | Running on device cuda:0
2024-07-13 18:39:34 | INFO     | DF | Model loaded
Using device=cuda
Loading model ...
using dtype=float16
Time to load model: 12.20 seconds
Compiling...Can take up to 2 mins.
---------------------------------------------------------------------------
TorchRuntimeError                         Traceback (most recent call last)
[<ipython-input-7-4fd93583efa0>](https://localhost:8080/#) in <cell line: 7>()
      5 torch._dynamo.config.suppress_errors = True
      6 
----> 7 tts = TTS()

89 frames
[/usr/local/lib/python3.10/dist-packages/torch/_dynamo/utils.py](https://localhost:8080/#) in run_node(tracer, node, args, kwargs, nnmodule)
   1762         try:
   1763             if op == "call_function":
-> 1764                 return node.target(*args, **kwargs)
   1765             elif op == "call_method":
   1766                 return getattr(args[0], node.target)(*args[1:], **kwargs)

TorchRuntimeError: Failed running call_function <built-in function scaled_dot_product_attention>(*(FakeTensor(..., device='cuda:0', size=(2, 16, s0, 128)), FakeTensor(..., device='cuda:0', size=(2, 16, 2048, 128), dtype=torch.float16), FakeTensor(..., device='cuda:0', size=(2, 16, 2048, 128), dtype=torch.float16)), **{'attn_mask': FakeTensor(..., device='cuda:0', size=(1, 1, s0, 2048), dtype=torch.bool), 'dropout_p': 0.0}):
Expected query, key, and value to have the same dtype, but got query.dtype: float key.dtype: c10::Half and value.dtype: c10::Half instead.

from user code:
   File "/content/metavoice-src/metavoice-src/fam/llm/fast_inference_utils.py", line 131, in prefill
    logits = model(x, spk_emb, input_pos)
  File "/usr/local/lib/python3.10/dist-packages/torch/nn/modules/module.py", line 1541, in _call_impl
    return forward_call(*args, **kwargs)
  File "/content/metavoice-src/metavoice-src/fam/llm/fast_model.py", line 160, in forward
    x = layer(x, input_pos, mask)
  File "/usr/local/lib/python3.10/dist-packages/torch/nn/modules/module.py", line 1541, in _call_impl
    return forward_call(*args, **kwargs)
  File "/content/metavoice-src/metavoice-src/fam/llm/fast_model.py", line 179, in forward
    h = x + self.attention(self.attention_norm(x), mask, input_pos)
  File "/usr/local/lib/python3.10/dist-packages/torch/nn/modules/module.py", line 1541, in _call_impl
    return forward_call(*args, **kwargs)
  File "/content/metavoice-src/metavoice-src/fam/llm/fast_model.py", line 222, in forward
    y = F.scaled_dot_product_attention(q, k, v, attn_mask=mask, dropout_p=0.0)

Set TORCH_LOGS="+dynamo" and TORCHDYNAMO_VERBOSE=1 for more information`

Moreover I am also getting this error when I tried to run model locally using instruction provided by README.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Broken Colab Demo #187

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Broken Colab Demo #187

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions