Movie Genre Prediction Based on Posters
Role: Researcher & Developer
Duration: 2024

Movie Genre Prediction Based on Posters
Analysis of movie genre classification using convolutional neural networks and transfer learning
Abstract
This article presents a method for classifying movie genres using only movie posters as input. The approach combines image processing techniques with convolutional neural networks (CNN) and transfer learning to extract relevant visual features and perform multiclass classification. The model was trained on a dataset of over 45,000 movie posters, achieving over 85% accuracy in classifying major genres. Analysis of the convolutional layers reveals specific visual patterns associated with each genre, providing insights into the visual language of cinema.
1. Business Vision
Automatic movie genre classification has important applications in streaming, content recommendation, and market analysis. By using only posters — already available and standardized resources — the method reduces implementation costs and provides valuable insights into cinema's visual communication.
2. Data and Preprocessing
The dataset was built from multiple sources:
"We used the TMDB API to collect posters and metadata, combining with IMDb information for cross-validation of genres." (GitHub)
The distribution of movies by year can be seen below:
The initial distribution of genres showed significant imbalance:
After preprocessing and balancing, we achieved a more uniform distribution:
3. Methodology
We tested two different architectures:
3.1 Custom Neural Network
A custom CNN architecture was designed specifically for this task:
The learning curve shows good convergence:
The confusion matrix reveals the model's performance:
3.2 VGG16 with Transfer Learning
We also implemented a VGG16-based model with transfer learning:
4. Applied Competencies
- Computer Vision: Image processing and feature extraction
- Deep Learning: CNN architecture design and training
- Transfer Learning: Fine-tuning of pre-trained models
- Data Engineering: Dataset construction and preprocessing
- MLOps: Model deployment and API development
5. Results
- Accuracy: 85% in classifying major genres
- Inference Time: Less than 100ms per image
- Visual Patterns: Identification of genre-specific visual elements
- API: REST endpoint for real-time classification
6. Impact and Next Steps
The model can be integrated into streaming platforms and content management systems to:
- Automate genre tagging
- Improve content recommendations
- Analyze visual trends in cinema
- Support marketing decisions
Future work includes:
- Expanding to subgenres
- Incorporating temporal analysis
- Developing a web interface
- Adding explainability features
7. Conclusion
The proposed method demonstrates that movie posters contain sufficient visual information for genre classification. The combination of custom CNN and transfer learning approaches provides a robust solution that can be easily integrated into existing systems. The analysis of visual patterns also offers valuable insights for the film industry.