AlphaSense logo


In office: New Delhi

  • 🇮🇳 India

Software Engineer II - Content Platform

About AlphaSense:

AlphaSense provides an AI-based search engine for market intelligence, used by the largest and fastest-growing firms globally. Our mission is to curate and semantically index the world’s market and company information, including the vast high-value content sets that traditional web search engines cannot reach. With 3500+ enterprise clients, AlphaSense helps knowledge professionals become dramatically more productive, and gain an information edge by discovering critical data points and trends that others miss.

Check out what we’ve built so far:

1. The decision that matters

2. India Office -

The Role:

We are seeking a skilled Software Engineer II to join our dynamic team responsible for building and maintaining data ingestion systems at scale. In this role, you will have the opportunity to work on challenging projects and collaborate with cross-functional teams to design and implement robust solutions for scraping millions of documents per month. Experience with web scraping systems is a plus but not required.

What You’ll Do:

  • Design, develop, and maintain scalable data ingestion pipelines to scrape large volumes of documents efficiently and reliably from various sources on the web.

  • Collaborate with team members and stakeholders to gather requirements, define technical specifications, and implement solutions that meet business needs.

  • Write clean, maintainable code and perform code reviews to ensure quality and adherence to coding standards.

  • Troubleshoot and debug issues in web scraping workflows, and implement solutions to optimize performance and reliability.

  • Stay up-to-date with emerging technologies and best practices in data engineering, and propose innovative solutions to enhance our data ingestion capabilities.

  • Contribute to the continuous improvement of our development processes and tools, and actively participate in team meetings and discussions.

Candidate Requirements:


  • Bachelor's or Master's degree in Computer Science, Engineering or a related field.

  • Minimum 2 years of experience in Software Development with 2+ years of proficiency in Python.

  • Good understanding of data structures, algorithms and computer science fundamentals.

  • Experience developing applications in Python frameworks (e.g. Django, Flask, FastAPI).

  • Experience in structured/unstructured data extraction through text processing, audio processing, regular expressions etc.

  • Excellent problem-solving skills and ability to work independently as well as collaboratively in a team environment.

  • Strong communication and interpersonal skills, with the ability to effectively collaborate with team members and stakeholders.

Nice to have

  • Proficiency in programming languages such as Java or GO, and experience with related frameworks and libraries for data processing and manipulation.

  • Knowledge of web scraping techniques and APIs for retrieving multimedia content from the public web is a plus.

  • Working knowledge of any cloud service provider(preferably AWS)

  • Familiar with IAC technologies such as Terraform and Crossplane.

  • Experience with workflow orchestration tools like Airflow, Prefect

  • Exposure NoSQL solutions like MongoDB, DynamoDB, etc.

  • Experience with working on Dockers, K8s