Skip to content

Commit c8abce7

Browse files
authored
update whisper woq int8 accuracy to good (green sign) (#3153)
1 parent ad2a4df commit c8abce7

File tree

3 files changed

+3
-3
lines changed

3 files changed

+3
-3
lines changed

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -54,7 +54,7 @@ In the current technological landscape, Generative AI (GenAI) workloads and mode
5454
|Phi| microsoft/Phi-3-mini-128k-instruct | 🟩 | 🟩 | 🟨 | 🟩 | 🟨 |
5555
|Phi| microsoft/Phi-3-medium-4k-instruct | 🟩 | 🟩 | 🟨 | 🟩 | 🟨 |
5656
|Phi| microsoft/Phi-3-medium-128k-instruct | 🟩 | 🟩 | 🟨 | 🟩 | 🟨 |
57-
|Whisper| openai/whisper-large-v2 | 🟩 | 🟩 | 🟩 | | |
57+
|Whisper| openai/whisper-large-v2 | 🟩 | 🟩 | 🟩 | 🟩 | |
5858

5959
- 🟩 signifies that the model can perform well and with good accuracy (<1% difference as compared with FP32).
6060

docs/_static/htmls/tbl_single.html

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -341,7 +341,7 @@
341341
<td><p style="text-align: center; vertical-align: middle;">🟩</p></td>
342342
<td><p style="text-align: center; vertical-align: middle;">🟩</p></td>
343343
<td><p style="text-align: center; vertical-align: middle;">🟩</p></td>
344-
<td><p style="text-align: center; vertical-align: middle;"></p></td>
344+
<td><p style="text-align: center; vertical-align: middle;">🟩</p></td>
345345
<td><p style="text-align: center; vertical-align: middle;"></p></td>
346346
</tr>
347347
</tbody>

examples/cpu/llm/inference/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -40,7 +40,7 @@
4040
|Phi| microsoft/Phi-3-mini-128k-instruct | 🟩 | 🟩 | 🟨 | 🟩 | 🟨 |
4141
|Phi| microsoft/Phi-3-medium-4k-instruct | 🟩 | 🟩 | 🟨 | 🟩 | 🟨 |
4242
|Phi| microsoft/Phi-3-medium-128k-instruct | 🟩 | 🟩 | 🟨 | 🟩 | 🟨 |
43-
|Whisper| openai/whisper-large-v2 | 🟩 | 🟩 | 🟩 | | |
43+
|Whisper| openai/whisper-large-v2 | 🟩 | 🟩 | 🟩 | 🟩 | |
4444

4545
## 1.2 Verified for distributed inference mode via DeepSpeed
4646

0 commit comments

Comments
 (0)