Hi, I'm Mayank Arora

I'm a SWE at HubSpot

Projects

Image

Algovis

Wrote a python library that made it easier for users to analyse and visualize sorting and searching algorithms.

Image

Machine Learning-based Intrusion Detection for Virtual Infrastructures

For my master’s dissertation, worked on how AI can be used to detect unusual behaviours or intrusions in a virtual infrastructure like Kubernetes. Created and deployed a microservices architecture that captured packets, processed them, scored them using an ML model and displayed the results on a dashboard. This was deployed locally on minikube and on Azure Kubernetes Services. Intrusion on the deployed WordPress website was simulated using WPScan and a password dictionary. ML/DL techniques like IsolationForest, DBSCAN and autoencoders were analysed for this project.

Image

Distributed Systems Project

Designed, implemented and deployed a microservices-based architecture to test athletes for doping. Created the user profiles, login infrastructure and authentication. Helped team learn docker and Kubernetes and managed the deployment using Azure Kubernetes Services and load testing using BlazeMeter.

Image

Artificial Intelligence Project

Created a bot that uses BFS/DFS, Minimax, Q-Learing or a neural net approach to play Connect-4. Helped create an arena where bots were pitted against each other to analyse their overall performance.

Image

Information Retrieval Project

Lead a team of four students to create a search engine for a project using Apache Lucene and Java based on TREC 2004 Robust Track. Our team stood 2nd out of 15 teams

Image

WhatsApp Analyser

Ever wanted to make a cool word cloud out of your Whatsapp chats and pinpoint the moment where you stopped texting your best friend? (Works for both individual chats and group chats)

Experience

Co-op @ Email Analytics

Dublin, Ireland (Remote) from February 2021 to September 2021

  • Team: Part of the email analytics team that helped HubSpot’s customers increase email engagement with analytics. This team managed the system that processed, stored and indexed 15 Billion events per month (as of August 2021)
  • Tests: Wrote unit, integration and acceptance tests for various components of the system, increasing robustness and reliability of the system
  • Bounce project: Worked on an independent data analysis project that helped HubSpot with calculating the percentage of misclassified email bounce types and evaluating the scope of the problem
  • Splitting Kafka worker: Split a Kafka worker into two through which all of HubSpot’s 15 Billion email events flowed. Added new metrics, wrote the ATs and deployed the worker in QA and Production
  • Internal tooling: Created a universal internal endpoint that handles all event tracking URLs and returns its metadata, reducing developer time
  • JIRAs: Investigated and solved multiple internal and customer-facing JIRAs
  • Metrics: Added alerts, metrics and created charts in SignalFx to monitor the health of the system

Programming Skills

Skills

Git, Data Anlaysis, Linux Shell Scripting, CI/CD, Kubernetes

Languages

Python, Java, C, C++, SQL, HTML, CSS, JavaScript, Bash

Frameworks

Flask, DropWizard

About Me

I'm a masters in computer engineering graduate from Trinity College, Dublin. I like to build things using code and play tennis.