I recently had the opportunity to present at a Computer Vision Hangout, sharing my journey from autonomous drone competition to fine-tuning Vision-Language Models.
I built an interactive presentation app! Here's a glimpse of the topics:
🚁 Black Bee Drones: My first steps into CV with Latin America's first autonomous drone team. Covering classical CV techniques (filtering, edge detection), the IMAV 2023 mission (ArUco detection, line following with PID control), and links to demos for OpenCV basics and PID simulation.
🤖 Asimo Foundation: Using MediaPipe for gesture control of a robotic arm in an educational project.
☕ CafeDL: Building a small Deep Learning framework from scratch in Java (inspired by Keras, using ND4J) and training a CNN for a QuickDraw-like app.
🏢 Tech4Humans: Real-world applications, including open-source signature detection and efficient fine-tuning of VLMs for document extraction.
Check out the interactive demos (also embedded in the main app):
Sharing my new article on an open-source project for automated signature detection in document processing. The article details:
- Dataset Engineering: Combining two public collections to create a hybrid dataset. - Architecture Benchmarking: Evaluating sota models such as YOLO series, DETR variants, and YOLOS for accuracy and efficiency. - Model Optimization: Using Optuna for hyperparameter tuning, achieving a 7.94% F1-score improvement. - Production Deployment: Implementing Triton Inference Server with an OpenVINO CPU backend for optimized inference.
It's not such a complex project, but I explore the training of the best current architectures for object detection and share all notebooks, data, models, and the repo with deployment and benchmarking details.
I wrote a article on Parameter-Efficient Fine-Tuning (PEFT), exploring techniques for efficient fine-tuning in LLMs, their implementations, and variations.
The study is based on the article "Scaling Down to Scale Up: A Guide to Parameter-Efficient Fine-Tuning" and the PEFT library integrated with Hugging Face's Transformers.