Data Engineer in Norfolk, VA at Vaco

Date Posted: 10/10/2019

Job Snapshot

Job Description

Vaco is seeking a Data Engineer to join an industry leader in the retail space.

We are searching for a talented Data Engineer, who's passionate about and capable of developing, maintaining, testing, and evaluating big data solutions and technologies in support of several classified advertising businesses. You will work on implementing big data projects with a focus on collecting, parsing, managing, analyzing and helping data scientists visualize large sets of data to help turn information into insights using multiple platforms.

Responsibilities include:

  • Gather and process raw data at scale (including writing scripts, web scraping, calling APIs, writing SQL queries, etc.)
  • Work closely with our engineering team to integrate new innovations and algorithms into our production systems.
  • Process unstructured data into a form suitable for analysis - and then help do the analysis.
  • Support business decisions with ad hoc analysis as needed.
  • We utilize AWS, so experience with EMR and other web services will help you hit the ground running.
  • Research and assess the viability of new processing and data storage technologies

Qualifications:

  • Bachelor's or Master's degree in computer science or software engineering or equivalent experience in addition to several years experience in an engineering position supporting high traffic and high volume data processing products
  • Strong knowledge of and experience with statistics, potentially other advanced math as well.
  • Programming experience in Python required
  • Deep knowledge in data mining, machine learning, natural language processing, or information retrieval.
  • Also experience developing prototypes and proof of concepts for proposed solutions is a must.

Preferred:

  • Experience in Node.js or Javascript frameworks
  • Experience processing large amounts of structured and unstructured data. Map Reduce experience.
  • Experience with some or all of the following: Amazon Web Services, Analytics, Big Data, Chef, Distributed Systems, Hadoop, MongoDB, PHP, Lucene Solr, ELK.