š¤ HuggingFaceSignificantDailyPapers
Microsoft Phi-Ground-Any: A 4B Vision Model for Precise GUI Grounding
Microsoft released Phi-Ground-Any, a 4B parameter vision-language model for GUI grounding on Hugging Face. The model achieves state-of-the-art results on ScreenSpot-pro and UI-Vision benchmarks, enabling precise screen eā¦