Running phi-2 on MacOS with returns garbage

### 🐛 Describe the bug

Almost verbatim HF example
```python
import torch
from torch.profiler import profile, ProfilerActivity
from transformers import AutoModelForCausalLM, AutoTokenizer
model = AutoModelForCausalLM.from_pretrained("microsoft/phi-2", torch_dtype="auto", trust_remote_code=True)
print(next(model.parameters()).dtype)
tokenizer = AutoTokenizer.from_pretrained("microsoft/phi-2", trust_remote_code=True)

inputs = tokenizer('''def print_prime(n):
   """
   Print all primes between 1 and n
   """''', return_tensors="pt", return_attention_mask=False)

outputs = model.generate(**inputs, max_length=32)
text = tokenizer.batch_decode(outputs)[0]
print(text)
```

Changing data type to bf16 fixes the problem. This is tru for both CPU and MPS

### Versions

CI

cc @kulinseth @DenisVieriu97 @jhavukainen @albanD @snadampal @milpuz01 @aditew01 @nikhil-arm @fadara01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Running phi-2 on MacOS with returns garbage #160841

🐛 Describe the bug

Versions

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Running phi-2 on MacOS with returns garbage #160841

Description

🐛 Describe the bug

Versions

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions