May 22, 2024, 8:27 a.m. | Francesco Mattia

DEV Community dev.to




Introduction


I am diving into the vision capabilities of large language models (LLMs) to see if they can accurately classify images, specifically focusing on spotting door handle positions to tell if they’re locked or unlocked. This experiment includes basic tests to evaluate accuracy, speed, and token usage, offering an initial comparison across models.


Code on GitHub






Scenario


Imagine using a webcam to monitor door security, providing images of door handles in different lighting conditions (day and night). The system’s goal …

accuracy basic can capabilities computervision door for home genai home home security images introduction language language models large llms locked security speed tests token unlocked

Information Technology Specialist I, LACERA: Information Security Engineer

@ Los Angeles County Employees Retirement Association (LACERA) | Pasadena, CA

Manager Pentest H/F

@ Hifield | Sèvres, France

Information System Security Officer

@ Parsons Corporation | USA VA Chantilly (Client Site)

Vulnerability Analyst, Mid

@ Booz Allen Hamilton | USA, VA, McLean (8283 Greensboro Dr, Hamilton)

SAP Security and Compliance Auditor

@ Bosch Group | Warszawa, Poland

Head of Product Security (Business team)

@ Zalando | Berlin