Data Engineer

To apply for this role, please read the job description below and send your resume to with the subject line as “Data Engineer vacancy”.

Why Reverie exists:

Today we make majority of our fellow citizens who can’t speak English but are already online feel unwelcome on the Internet. Whenever they visit your site or app, they are met with a wall of text in a language that they can’t understand.

We at Reverie believe in Language Equality on the InternetTM. This means that every non-English speaker, should enjoy as native and organic an experience online as you and I take for granted. We are working towards this mission by building the full-stack of language technologies spanning fonts, font rendering, transliteration (when you write the same word in a different script), translation, language apps, and multilingual search for web portals and mobile apps.

What Reverie does:

Reverie helps you connect beyond India’s 10% English users. We understand your business context to provide fast and scalable solutions via our Language-as-a-ServiceTM (LaaSTM) platform. We take immense pride in the fact that we tackle the most complex and impactful problems in computing today. To that end, we’ve worked super hard to assemble a team of experts across machine learning, NLP, linguistics, data science, machine translation and multilingual search. Now, localisation is just an API away.

Required skills for this role:

  • Build a robust ETL pipeline to manage data from disparate of data sources
  • Build and scale our backend data stores and compute engines to process large quantities of data
  • Create text corpora of multilingual data, and tools to process them in a variety of ways
  • Build a low latency serving layer that powers our dashboards, reports, and other analytics functionality
  • Build an analytics pipeline to serve actionable insights and recommendations to our customers
  • Be passionate about writing code. Have experience coding in multiple languages, including at least one scripting language. Be able to argue convincingly why feature X of language Y rocks/sucks, and so on
  • Have some personal projects that you work on during your spare time. Show off projects you have hosted on Github
  • Use the command line like a pro. Be proficient in regular expressions, Xpath and other libraries for parsing and extracting unstructured data
  • Have built RESTful APIs; addressed challenges of scale and performance
  • Have exposure to large­ scale computational models such as MapReduce and Spark
  • Have experience using one or more storage and indexing technologies such as MongoDB, Cassandra, Solr, Elastic
  • Build a culture of data and statistics driven thinking in everything we do
  • Be a generalist who has the ability to pick up any of these over a weekend and get to work on the Monday next
  • Be a self-­starter, someone who thrives in environments with minimal “management”

Bonus Points:

  • Have exposure to analytics tools and libraries like R and Pandas
  • Have a background in machine learning

What we DON’T care about:

Your age, gender, where you went to college, or your academic scores.

What we DO care about:

  • Our mission resonates with you
  • You meet most of the requirements listed above
  • You have an insatiable curiosity, which means you’ll figure a way out even in an unfamiliar environment
  • And finally your integrity and work ethic

To apply for this role, please read the job description below and send your resume to with the subject line as “Data Engineer vacancy”.