Movie Genre Prediction Based on Posters

Analysis of movie genre classification using convolutional neural networks and transfer learning

Abstract

This article presents a method for classifying movie genres using only movie posters as input. The approach combines image processing techniques with convolutional neural networks (CNN) and transfer learning to extract relevant visual features and perform multiclass classification. The model was trained on a dataset of over 45,000 movie posters, achieving over 85% accuracy in classifying major genres. Analysis of the convolutional layers reveals specific visual patterns associated with each genre, providing insights into the visual language of cinema.

1. Business Vision

Automatic movie genre classification has important applications in streaming, content recommendation, and market analysis. By using only posters — already available and standardized resources — the method reduces implementation costs and provides valuable insights into cinema's visual communication.

2. Data and Preprocessing

The dataset was built from multiple sources:

"We used the TMDB API to collect posters and metadata, combining with IMDb information for cross-validation of genres." (GitHub)

The distribution of movies by year can be seen below:

Distribution of movies by year

The initial distribution of genres showed significant imbalance:

Initial distribution of genres

After preprocessing and balancing, we achieved a more uniform distribution:

Final distribution of genres

3. Methodology

We tested two different architectures:

3.1 Custom Neural Network

A custom CNN architecture was designed specifically for this task:

Custom neural network architecture

The learning curve shows good convergence:

Custom network learning curve

The confusion matrix reveals the model's performance:

Custom network confusion matrix

3.2 VGG16 with Transfer Learning

We also implemented a VGG16-based model with transfer learning:

VGG16 confusion matrix

4. Applied Competencies

Computer Vision: Image processing and feature extraction
Deep Learning: CNN architecture design and training
Transfer Learning: Fine-tuning of pre-trained models
Data Engineering: Dataset construction and preprocessing
MLOps: Model deployment and API development

5. Results

Accuracy: 85% in classifying major genres
Inference Time: Less than 100ms per image
Visual Patterns: Identification of genre-specific visual elements
API: REST endpoint for real-time classification

6. Impact and Next Steps

The model can be integrated into streaming platforms and content management systems to:

Automate genre tagging
Improve content recommendations
Analyze visual trends in cinema
Support marketing decisions

Future work includes:

Expanding to subgenres
Incorporating temporal analysis
Developing a web interface
Adding explainability features

7. Conclusion

The proposed method demonstrates that movie posters contain sufficient visual information for genre classification. The combination of custom CNN and transfer learning approaches provides a robust solution that can be easily integrated into existing systems. The analysis of visual patterns also offers valuable insights for the film industry.