Zero-shot classification of art with large language models
- Date:
- February 25, 2025
- Source:
- University of Tsukuba
- Summary:
- Traditional machine learning models for automatic information classification require retraining data for each task. Researchers have demonstrated that art data can be automatically classified with sufficient accuracy by using a large language model (LLM), without requiring additional training data.
- Share:
Art has emerged as a significant investment asset. This has led to growing interest in art price prediction as a tool for assessing potential returns and risks. However, organizing and annotating the data required for price prediction is challenging due to the substantial human costs and time involved. To address this, researchers applied a technique known as "zero-shot classification," which leverages a large language model (LLM) to classify data without the need for pre-prepared training data.
The research team explored the feasibility of automatically determining artwork types -- such as paintings, prints, sculptures, and photographs -- by optimizing the LLM "Llama-3 70B," an open model, to a 4-bit format. The results confirmed that the model classified artwork types with an accuracy exceeding 90%. Furthermore, when compared to OpenAI's GPT-4o generative AI, it achieved slightly higher accuracy.
This approach enables performance comparable to conventional machine learning methods while notably reducing the human effort and time required for data organization. These results could enhance accessibility to art analyses and price evaluation, expanding opportunities not only for investment but also for research and appreciation.
Story Source:
Materials provided by University of Tsukuba. Note: Content may be edited for style and length.
Journal Reference:
- Tatsuya Tojima, Mitsuo Yoshida. Zero-Shot Classification of Art With Large Language Models. IEEE Access, 2025; 13: 17426 DOI: 10.1109/ACCESS.2025.3532995
Cite This Page: