1
Feature Story
Add support for Qwen2VL by HimariO · Pull Request #10361 · ggerganov/llama.cpp
Dec 15, 2024 · github.comThe failures were consistent across both f32 and f16 data types, particularly in modes 8 and 24, suggesting that these modes may require further investigation or optimization. Despite these failures, many configurations, especially those with adjusted parameters such as af=1.424500, passed successfully, indicating that certain adjustments can mitigate the issues. Overall, the data suggests that while the Metal backend performs well in many scenarios, specific configurations still present challenges that need addressing to ensure robust performance across all ROPE operations.
Key takeaways
- Some ROPE tests are failing when using the Metal backend, particularly in specific configurations.
- Failures are indicated by high NMSE values, exceeding the threshold of 0.000000100, in certain test cases.
- Tests with different configurations, such as varying dimensions and modes, show a mix of pass and fail results.
- Both f32 and f16 types are tested, with some failures occurring in both types under similar conditions.