Skip to main content

Hi, I'm

Andrea Alberti

Building intelligent systems with AI

Machine Learning
Multi-Agent Systems
LLM Models
Deep Learning
RAG Systems

Scroll to explore

About Me

Who I Am

Andrea Alberti

Andrea Alberti

GenAI Engineer & Data Scientist

Graduated with a double degree in Management and Computer Science–Data Science, I have gained substantial experience in multidisciplinary projects. I specialize in applying machine learning, deep learning, and most recently Generative AI techniques to develop innovative and automated solutions.

My academic journey, from Management Engineering (110/110 cum laude) to a Master's in Data Science (110/110 cum laude), has given me a unique blend of technical expertise and strategic thinking. This allows me to approach complex problems from both a business and a technological perspective.

Professionally, I focus on building intelligent systems—from multi-agent architectures that automate complex business processes to advanced conversational AI—to drive efficiency and reduce operational costs. My goal is to continuously improve myself, using AI to create tangible value.

Download Full CV
2022 - 2024110/110 L

MSc. Data Science - Computer Engineering

University of Pavia

Download Thesis
2018 - 2021110/110 L

BSc. Management Engineering

University of Brescia

Download Thesis

Certification

Google Cloud Professional ML Engineer

Google Cloud

Verify

Technical Skills

Core Skills

RAG
Multi-Agent Systems
Prompt Engineering
Few-Shot Learning
Problem Solving
System Design
Data Analysis
Research
Public Speaking
Presentation Design
Workshop Creation

AI Tools

Claude Code
Codex
GitHub Copilot
Gemini
ChatGPT
Claude
NotebookLM
Gemini CLI

Cloud & Infrastructure

Google Cloud Platform
Vertex AI
Cloud Run
Cloud Functions
BigQuery
WebSocket
Agent Engine

Programming Languages

Python
SQL
HTML/CSS

Frameworks & Libraries

Google ADK
Dialogflow CX
Gemini SDK
Vertex AI Search
TensorFlow
PyTorch
Scikit-learn
Keras
XGBoost
FastAPI
Flask
PySpark
Hadoop
MongoDB
NetworkX
Pandas
NumPy
OpenCV
Matplotlib
Professional Work

Professional Projects

A selection of professional projects where I applied Generative AI and Machine Learning to solve real-world problems.

LiveFeatured

AI-Powered Tyre Selection Assistant

Refined and tested a Dialogflow CX conversational agent for the UK market, guiding users in tyre selection through a multi-agent system with RAG and external APIs. Successfully migrated the solution to Google Agent Development Kit (ADK), expanding to international markets and new vehicle categories with cross-cloud AWS-GCP integration.

Dialogflow CXGoogle ADKMulti-AgentRAG
Multi
Markets
Cross-Cloud
Architecture
Expanded
Categories
CompletedFeatured

Automated Document Validation System

Full-stack implementation of a multi-agent system to automate verification and validation of documents for public funding requests. Developed backend in Python and frontend in HTML, CSS, and JavaScript. Designed a modular and flexible architecture that allows agent behavior adaptation without code modification.

Multi-AgentPythonProcess AutomationGCP
90%
Time Reduction
Modular
Architecture
Full
Stack
Completed

Advanced Knowledge Base Chatbot

Implementation of advanced chatbot agent architecture to answer user questions using a knowledge base of web pages and PDF documents. System designed with a main orchestrator agent routing requests to five specialized sub-agents. Developed data management pipeline including PDF parsing, chunking strategy, and security layer for inappropriate questions.

Dialogflow CXGCPMulti-AgentRAG
5
Sub-Agents
Multi
Domains
PDF+Web
Documents
Completed

Real-Time Multimodal Agent

Development and implementation of real-time multimodal conversational agent based on Google Agent Development Kit (ADK). System capable of processing audio and video inputs simultaneously, sustaining fluid conversations, and autonomously performing browser operations through reasoning and tools. Asynchronous dual-server architecture with WebSocket communication for low-latency bidirectional streaming.

Google ADKGemini Live APIMultimodal AIBrowser Automation
<500ms
Latency
A+V
Modalities
Async
Architecture
Completed

Luxury Yacht Virtual Assistant

Implementation of virtual assistant with aim of answering user questions based on corporate knowledge base. Configured as RAG (Retrieval-Augmented Generation) application, leveraging Google Cloud ADK and Vertex AI Search to retrieve relevant information from dedicated datastore. Architecture uses Gemini models to process requests and generate accurate, contextualized responses in Italian.

RAGVertex AI SearchGoogle ADKGCP
ITA
Language
95%
Accuracy
RAG
Technology
Completed

Multi-Agent Ticketing System

Implementation of multi-agent ticketing system to automate user responses. Project developed as Proof of Concept (POC), involving creation of specialized agents, each capable of interacting with external databases via APIs to provide accurate and comprehensive replies. Multi-agent architecture ensures user requests are routed to most competent agent, reducing staff workload and management costs.

Multi-AgentGoogle ADKAPI IntegrationAutomation
-70%
Workload
Multi
Agents
POC
Type
Completed

LLM-Based Email Classification Pipeline

Within broader project aimed at automating manual process and reducing operational costs, contribution focused on creating LLM-based pipeline for classification and dispatch of certified emails (PEC). Key activities included prompt engineering and few-shot learning to refine model outputs, along with development of metrics and analytical tools for system performance evaluation.

LLMPrompt EngineeringFew-Shot LearningProcess Automation
High
Cost Reduction
Few-Shot
Technique
85%
Automation
Completed

Insurance Liquidation Engine - Demo Project

Developed comprehensive multi-agent collaborative architecture for internal Sharing Days demonstration. System automatically analyzes, processes, and issues liquidation judgments on insurance claims. Architecture combines parallel agents for document analysis with sequential agents for final evaluation, utilizing custom tools, built-in Google Cloud services, and MCP integration.

Google ADKMulti-AgentCloud RunMCP
3
Agent Types
Hybrid
Architecture
Demo
Purpose
Academic Research

Research & Academic Work

8 papers
80+ pages
6 cum laude
Heart Disease Detection from Audio Signals
Machine Learning

Heart Disease Detection from Audio Signals

Advanced Biomedical Machine Learning

This study presents two machine learning models (MLP_Ensemble5 and MLP_Ensemble2) aimed at enhancing heart disease detection from heart sound recordings using ensemble techniques. The project involved data preprocessing, feature extraction (MFCCs, Chroma STFT, etc.), and feature selection using data from the PASCAL CHSC2011 challenge.

Key Results
0.82
F1-Score
0.96
ROC-AUC
81.53%
MCC
43.4%
TPR at 1% FPR
41 (from 338)
Features
Jul 2024
17 pages
Scikit-learnTorchaudioLibrosa
Code
Disease Prediction with Graph Machine Learning
Graph ML

Disease Prediction with Graph Machine Learning

Financial Data Science

This study investigated the complex relationships between symptoms and diseases using network analysis techniques on a large dataset. A bipartite graph was created and analyzed using metrics like degree distribution, Hidalgo's method of reflections, betweenness centrality, and community detection. Novel features derived from these network metrics were used to train predictive models (Logistic Regression, Random Forest, MLP).

Key Results
87%
Accuracy
28%
Feature Reduction
1.5%
Accuracy Drop
9.4%
Training Time Reduction
Dec 2023
23 pages
PythonNetworkXScikit-learn
Code
Review Helpfulness Prediction with Big Data
Big Data

Review Helpfulness Prediction with Big Data

Data Science & Big Data Analytics

This project involved a comprehensive analysis of an Amazon book review dataset using big data tools (Hadoop, Spark) and data science techniques. It investigated factors influencing review helpfulness, such as review length, sentiment, and user rating. Predictive models (Random Forest, SVR, MLP) were built using Word2Vec embeddings to estimate review helpfulness.

Key Results
Random Forest
Best Model
0.026
MSE
0.25
Sep 2023
6 pages
Hadoop HDFSPySparkSpark MLlib
Code
Clickbait Detection in News Headlines
NLP

Clickbait Detection in News Headlines

Machine Learning

Implemented and compared Multinomial Naïve Bayes (MNB) and Logistic Regression (LR) classifiers for identifying clickbait headlines from a dataset of 32,000 examples. The project evaluated performance under two scenarios: maximizing overall accuracy and minimizing the False Positive Rate (FPR). Utilized Bag-of-Words feature representation with different vocabulary sizes.

Key Results
97.12%
Accuracy
0.0%
Best FPR
84.00%
FPR Accuracy
8000 words
Vocabulary
Feb 2024
6 pages
PythonScikit-learnNumPy
Code
DDoS Attack Detection and Mitigation
Security

DDoS Attack Detection and Mitigation

Enterprise Digital Infrastructure

This project experimentally assessed the impact of DNS reflection and amplification attacks within a controlled local network environment. It explored how different DNS request types affect amplification factors and analyzed the consequences on the target system's latency and the DNS server's resource utilization (CPU, RAM). Attacks were simulated using custom Scapy scripts with IP address spoofing.

Key Results
1.46
AF A
4.14
AF MX
4.46
AF NS
Jan 2024
11 pages
PythonScapyWireshark
Code
Cake Classification Features Analysis
Machine Learning

Cake Classification Features Analysis

Machine Learning

This project developed models to classify images into 15 different cake categories. It compared two main approaches: using a Multi-Layer Perceptron (MLP) with low-level image features (Color Histogram, Edge Direction Histogram, Co-occurrence Matrix), and using an MLP with neural features extracted from a pre-trained CNN (PVMLNet). Transfer learning was also explored by adapting the pre-trained CNN.

Key Results
90%
Accuracy
90%
Neural Features
31%
Low-Level Features
80%
Transfer Learning
Feb 2024
5 pages
Pythonpvml libraryPVMLNet
Code
Vanishing Points Detection in Images
Computer Vision

Vanishing Points Detection in Images

Computer Vision

Developed two image processing programs. The first focuses on image binarization using a custom histogram-based thresholding technique, offering automatic and manual tuning. The second program detects vanishing points and lines using techniques like the Canny edge detector, probabilistic Hough transform, and the RANSAC algorithm. Both programs include command-line interfaces.

Key Results
500
RANSAC Iterations
5 pixels
Threshold
10 longest per run
Hough Lines
Mar 2024
7 pages
Python 3.9+OpenCVNumPy
Code
Sentiment Analysis on Social Media
NLP

Sentiment Analysis on Social Media

Machine Learning

This project developed and compared Multinomial Naive Bayes (MNB) and Logistic Regression (LR) classifiers for predicting sentiment (positive/negative) in movie reviews, using the dataset from Maas et al. The study explored the impact of vocabulary size and preprocessing techniques like stopword removal and stemming on classifier performance. Analysis included evaluating accuracy, overfitting trends, and identifying the most impactful words.

Key Results
85.4%
Accuracy
82.6%
MNB Accuracy
85.4%
LR Accuracy
1000 words
Best Vocabulary
Feb 2024
5 pages
PythonScikit-learnNLTK
Code
AI Playground

Interactive AI Demos

Explore cutting-edge AI capabilities through interactive demonstrations. From RAG-powered research tools to autonomous multi-agent systems.

Research Paper Explorer

Chat with my academic papers using RAG. Ask questions about my research in ML, DL, NLP, and Computer Vision.

RAGLangChainVector DBGemini

AI Board of Directors

Multi-agent system simulating a board of expert advisors. Watch agents debate and reach consensus on strategic decisions.

Multi-AgentCrewAIAutoGenCollaboration

Autonomous Research Assistant

AI agent with tools for web search, data analysis, and report generation. Demonstrates agentic workflows and tool use.

AgentToolsWeb SearchAnalysis
My Passions

Beyond Work

Other things I love to do

Active Body, Active Mind
Doing Sports

Active Body, Active Mind

It is important for me to have moments to take care of my health. I set up a small home gym and I also love playing tennis and football.

The Perfect Mix of Passion, Ability, and Strategy
Watching Formula 1

The Perfect Mix of Passion, Ability, and Strategy

I love Formula 1 because it represents the pinnacle of racing, combining cutting-edge technology with human skill and strategic thinking.

If you no longer go for a gap which exists, you are no longer a racing driver. — Ayrton Senna

A Long-Lasting Family Tradition
Watching Football

A Long-Lasting Family Tradition

I inherited from my grandfather the passion for football. I love watching and analyzing matches and I'm a big fan of AC Milan since ever.

Get in Touch

Let's Connect

Open to opportunities and collaborations. Feel free to reach out!

Let's Connect

Open to opportunities and collaborations

LinkedIn
Andrea Alberti
GitHub
@AndreaAlberti07
Response Time
Usually within 24 hours
Andrea Alberti | GenAI Engineer & AI Systems Architect