Data Scientist (Natural Language Processing)

  • Remote, Position
  • Full-time

WE’RE LOOKING FOR A PASSIONATE DATA SCIENTIST – ONE WHO KNOWS THAT DATA TELLS THE HISTORY OF REALITY, AND CAN BE USED TO PREDICT THE FUTURE. DID YOU STUDY DATA SCIENCE BECAUSE YOU FEEL THAT IT’S THE CLOSEST YOU CAN GET TO BEING A WIZARD WITH A CRYSTAL BALL? ARE YOU SO MEAN WHEN IT COMES TO ACCURATE DATA HANDLING THAT YOUR STANDARD DEVIATION IS ZERO?

 

IF SO, WE MIGHT JUST BE THE PERSON WE’RE LOOKING FOR. YOU’LL BE WORKING (REMOTELY) WITH A STARTUP THAT IS BUILDING SOFTWARE TO REVOLUTIONIZE THE WORLD OF ONLINE HIRING, THROUGH THE USE OF CUTTING-EDGE TECHNOLOGY IN AUTOMATION AND ARTIFICIAL INTELLIGENCE. THEY’VE BUILT A TEAM OF EXPERTS IN THOSE FIELDS AND NOW THEY ARE LOOKING FOR THE MISSING LINK. THAT’S WHERE YOU COME IN, WORKING WITH THEM TO FURTHER REFINE AND IMPROVE THEIR SOFTWARE.

 

HERE’S WHAT YOU’LL BE DOING:

  • DESIGN AND BUILD MACHINE LEARNING MODELS AND ALGORITHMS.
  • MEASURE AND IMPROVE THE PRECISION ACCURACY OF THE RANKING MODEL.
  • RECOMMEND NLP BASED TOOLS FOR FEATURE EXTRACTION FROM TEXTUAL DATA.
  • IMPROVE AND ENHANCE OUR NLP BASED ETL FEATURE EXTRACTION AND DATA MINING STRATEGY.
  • FEATURE SELECTION AND DIMENSION REDUCTION.
  • DEPLOY SUPERVISED AND UNSUPERVISED TECHNIQUES TO EXTRACT KEY INSIGHTS ON A JOB APPLICANT.
  • BUILD A REAL-TIME PREDICTIVE SYSTEM.

 

ONE DOES NOT SIMPLY WALK INTO DATA SCIENCE. YOU’LL NEED:

 

  • MASTERS OR DOCTORATE DEGREE IN THE FIELD DATA SCIENCE.
  • 3 TO 10 YEARS OF EXPERIENCE IN THE FIELD.
  • STRONG BACKGROUND IN ALGORITHMS AND DATA STRUCTURES.
  • EXPERIENCE IN MACHINE LEARNING AND STATISTICS TOOLS AND TECHNIQUES.
  • EXPERIENCE IN PYTHON AND SCALA.
  • EXPERIENCE WITH BIG DATA ML TOOLKITS SUCH AS SPARKML AND AZUREML.
  • PROVEN TRACK RECORD IN BUILDING PREDICTIVE MODELS ON SMALL AND LARGE DATASETS.
  • EXPERIENCE IN NLP AND TEXT MINING (NAMED-ENTITY RECOGNITION, SENTIMENT ANALYSIS).
  • STRONG COMMUNICATION, SELF-ORGANIZATION, AND LEADERSHIP SKILLS.
  • TO BE ABLE TO COMMUNICATE IN ENGLISH EFFICIENTLY AND EFFECTIVELY WITH THE CLIENTS, TEAM AND MANAGEMENT.

 

AND YOUR CHANCES WILL INCREASE BY 0.5 STANDARD DEVIATIONS (OR SO) IF YOU HAVE:

 

  • EXPERIENCE WITH NOSQL DATABASES SUCH AS MONGODB.
  • EXPERIENCE SETUP UP A SPARK OR HADOOP CLUSTER ON CLOUDERA OR SIMILAR OPEN-SOURCE BIG DATA FRAMEWORKS.
  • FAMILIARITY WITH AZURE CLOUD COMPUTING.
  • HANDS ON EXPERIENCE PROGRAMMING IN JAVA.

 

SOUNDS LIKE A FIT FOR YOU? SEND US YOUR CV, AND PLEASE INCLUDE THE ANSWERS TO THE FOLLOWING QUESTIONS:

  1. WHY DO YOU THINK YOU ARE A GOOD FIT FOR THIS PARTICULAR PROJECT?
  2. EVER WORKED ON AZUREML OR SPARKML?
  3. DO YOU HAVE ANY EXPERIENCE IN NLP AND N-GRAM EXTRACTION?
  4. HOW MANY PREDICTION MODELS HAVE YOU BUILT? WHAT WAS THE SMALLEST AND LARGEST DATA SET YOU’VE WORKED ON?
  5. DO YOU HAVE A MASTER’S OR PHD IN DATA SCIENCE?
  6. ARE YOU PROFICIENT IN ENGLISH?

GOOD LUCK!