Path Signature Pairs Trading
Filtering classic pairs trades by the shape of the price path using Lévy area.
Read MoreData @ Fastenal
Streaming platforms, event-driven systems, and cloud automation at Fastenal.
Privacy-first AI tools and developer products — used by real people.
Professional GIS tool for generating and visualizing bounding boxes with coordinate system support. An enhanced, feature-rich alternative to bboxfinder.com used by GIS professionals, data engineers, and developers working with geospatial data. Completely free with no sign-ups required.
Key Features: Draw rectangles, circles, and polygons with instant coordinate output • Support for multiple projections via EPSG codes • Import/export WKT, GeoJSON, and raw bbox coordinates • Toggle between Long/Lat and Lat/Long ordering • GDAL-friendly format output • Integrated search bar and satellite view options
A free, privacy-first Chrome extension providing instant context for selected text. Features intelligent multi-source search (Wikipedia, Wiktionary, Dictionary), on-device AI for summaries and translation (20+ languages), and a customizable draggable interface with dark mode. Zero tracking, ads, or data collection.
An interactive demographic simulator that models the global lottery of birth using the UN World, IMF and other data sources. Spin a weighted wheel to draw a random demographic persona based on current global birth distributions, and explore how your life expectancy, income, education, and freedoms compare across different countries and cohorts. Great for gaining perspective about your own life!
A specialized tool for GeoGuessr players to supercharge their Overpass queries. Helps players find specific locations based on map features, tags, and geographical data.
Engineered a fault-tolerant system demonstrating resilience and automation in hyperscale environments. Built with containerized microservices communicating via Redis Streams, implementing anomaly detection using scikit-learn's Isolation Forest to automate system recovery and ensure high availability.
Collection of production-ready design patterns and best practices for Apache Kafka Streams applications. Demonstrates real-world stream processing patterns including stateful operations, windowing, joins, and error handling strategies for building robust event-driven architectures.
Developed a novel quasi-Euclidean distance metric for calculating image similarity and evaluating its effectiveness against standard metrics like SSIM and Euclidean distance. Research paper "Determining Image similarity with Quasi-Euclidean Metric" published and peer-reviewed, available on ArXiv. Full implementation open-sourced demonstrating practical applications in computer vision and image processing.
Feature-rich, production-quality Discord bot with full-stack architecture. Includes data persistence, separate Express.js dashboard for configuration, and scheduled task execution via node-cron. Demonstrates end-to-end product ownership from development to deployment.
End-to-end data pipeline consuming historical quotes (Alphavantage API) and tweets (Twitter API) with sentiment analysis using TextBlob. Data stored in MongoDB and exported to AWS S3. Trained SparkML model exposed via REST API to predict future stock close prices for trading decisions.
Inspired by 2018 disasters in Uttarakhand and Kerala. Remodeled the state-of-the-art MaskRCNN algorithm to detect humans in flood-affected areas and evaluated real-world video performance for disaster response applications.
Backend team member for Mercury, a telemetry system developed as the main objective of an NYU graduate course. Designed and implemented scalable backend infrastructure for real-time telemetry data collection and processing.
Filtering classic pairs trades by the shape of the price path using Lévy area.
Read MoreA deep dive into building a browser extension for on-page lookups.
Read MoreExploring spatial data through interactive maps.
Read MoreSenior Engineer with 5+ years of expertise in designing, implementing, and managing scalable cloud infrastructure and automation platforms. Currently working as a Data Engineer at Fastenal, specializing in event-driven architectures and distributed systems.
Key Achievements:
- Architected Kafka-as-a-Managed-Service platform serving 15+ technology teams
- Maintained 99.9% uptime for critical event streams through proactive management
- Reduced integration time by 60% using Kafka Connect and Oracle GoldenGate
- Achieved 95% reduction in issue resolution time through Azure DevOps automation
- Manage ~50 RHEL VMs with Puppet, reducing cluster load by 60%
Outside of work, I'm passionate about cybersecurity—staying on top of breaches, hacks, data dumps, and new vulnerabilities, especially targeting Windows OS. I also enjoy playing CSGO, watching Chelsea, and photography. Currently learning on-device LLMs and exploring the intersection of AI and privacy.
Cloud & Infrastructure:
Infrastructure as Code & CI/CD:
Programming & Scripting:
Data Engineering & Databases:
Monitoring & Observability:
ML & Data Science:
Masters in Computer Science
New York University, New York
2019 - 2021
Interested in collaborating or just want to say hi?
Email: [email protected]
© 2026 Vibhor Singh. Built with and .