Aliabbas P.

Lead Data Scientist

Bengaluru , India

Experience: 10 Years

Aliabbas

Bengaluru , India

Lead Data Scientist

190000 USD / Year

Immediate: Available

10 Years

Now you can Instantly Chat with Aliabbas!

Chat Now

About Me

I have deep experience in NLP, Publishing, e-commerce and Healthcare domain, I have been working on tech projects since 10+ years

I have successfully completed 160+ real-world projects in the following area

· Heal...

· Healthcare Natural Language processing and Analyst

· Automated NLP based proofreading and grammar correction system

· Deep learning: Problem formulation, Custom Model development, and evaluation of Mode

· Big data technologies (Hadoop, Spark, Graph database

· AI-based automation of all business process

· Business forecast

· Data Visualization: All kinds of advanced data visualizations using Tableau, Bokeh, Matplotl

· High accuracy Speech recognition using state of art deep learning models with full ownersh

· Education Technology, Guidance on use of technology in large scale result-driven training setting.

· Statistical and Rule-based Natural language processing: Language Models, Natural language Understanding, Automatic Text Generation and Summarization

· Thesis/paper review: help Ph.D., Masters students to write an effective thesis includes a detailed review, comments, and guidance on tools to get quality work done.

Skills

Positions

Portfolio Projects

Description

I worked as an Lead NLP and data science consultant ata top editing company in India which caters to the STM research industry. The company offers English editing, translation, and transcription services to researchers, corporates, and pharmaceutical companies worldwide. I was tasked with developing an NLP based product to automate the various proofreading and grammar correction capabilities of the editors. I worked with highly experienced linguists, doctorates and management to design and develop a framework which automates the various editing tasks on English manuscripts. I developed a Hybrid machine learning and Rule based system which automatically corrects English documents of non-native authors of English manuscripts from Japan, China and Asia.

I also trained the linguists to use a specially crafted language developed by me to frame the rules for grammar correction. English grammar categories like Articles, and preposition cases were handled by a supervised machine learning system developed by me which handled 66% of the cases. Apache SPARK was used to improve the performance of the system by parallelizing the editing system.

Show More Show Less

Description

Lazada is the largest e-commerce service in sout-east asia based in Singapore. My charter here at lazada is to develop competitive market intelligence capabilities which allowed us to stay ahead of the competition. As part of market intelligence suite, I developed seller mapping modules which helped in identifying the same sellers across multiple competitors. This capability allowed us to find sellers which were not selling on our platform as well as sellers on our platform selling a narrower assortment range compared to its offerings on a competitor. We were able to cover close to 39% sellers in the Indonesia market. Similarly, I developed SKU-level competitor product matching modules for the Redmart channel in Lazada with a precision of around 97% and an almost perfect coverage.

Show More Show Less

Description

I worked as a Lead Data Scientist in an R&D lab of [Confidential Company] healthcare UK a clinical documentation company. Here I worked on their internal Quality Improvement project. It is of paramount importance in a clinical documentation company to know the quality of its medical transcribers (MTs) and proofers(PR). The problem we observed is that the errors committed by the medical transcribers are not being systematically marked hence there is no data driven way to measure the quality of MTs. Marking the errors committed by MTs would help in their education and the feedback from this data will help in improvement of quality. I worked towards developing a system of automatically evaluating the quality of work produced by medical transcribers by developing a system which faithfully labels the errors committed by the MTs using various machine learning algorithms. I worked along with a six-sigma consultant to develop a quality indicator system helpful to the managers to get a quick overview of the quality of the medical transcribers. It is an ongoing project with a vision to also detect the errors using the marked data.

Show More Show Less

Description

BREXIT was looming large and the value pound was 20% down, good quality of medical transcribers were hard to come by in our UK clinical documentation company. In this context this very ambitious project of complete AI based automation of the Entire medical transcription workflow took birth. Medical Transcription involved many stages namely manual medical transcription of audio, proofreading, quality assurance and submission to client. I oversaw the speech recognition project where we developed our own models which reached 91?curacy using state of the art deep learning models. I was given complete ownership of transforming speech recognition produced verbatim spoken text to fully formatted MS Word documents. Transforming pure raw text without any punctuation or signals to formatted documents was a very challenging task yet it was conquered using an optimal combination of conventional machine learning, deep learning and rule based systems. I single handedly designed and developed the complete infrastructure and software to accomplish this within six months time completely eliminating the manual processes except for some minor touch-ups and verification. Earlier a document used to take from two days to one week for submission we shortened the time to a few minutes giving a solid boost to the financials of the company when it needed it the most.

Show More Show Less