Smita P.

Data Science Analyst

Pune , India

Experience: 10 Years

Smita

Pune , India

Data Science Analyst

37020 USD / Year

Immediate: Available

10 Years

Now you can Instantly Chat with Smita!

Chat Now

About Me

Professional Data Science analyst with rich experience in Team Leading and Advisory Consulting roles. Quick to grasp new ideas, develop innovative solutions, and work well independently to meet deadlines....

Skills

Positions

Data Analysts

Data Scientist

Business Analysts

Data Engineer

Technology Analyst

Customer Insights Analyst

Portfolio Projects

Description

1. Worked on data gathering, data cleaning, data modelling,

2. Worked on explorative data analysis

3. Outlier and missing value detection and tratement

4.Data Imbalance

5. Features Selection

6. Building Classification Model

7. Evaluating Accuracy and other metrics

8. Parameter Tuning

9. Finalize the Model5.

Show More Show Less

Description

1. Applied the hierarchical clustering and plotted the dendrogram.

2. Identified the 3 cluster- High spending, medium spending and Low spending - with help of dendrogram

3. compared the results with k-means clustering.

4. By making use of this data, company can announce various offers to various segments

5. Appliedhierarchical clustering , Agglomerative Clustering, Kmeans clustering, Python, Pnadas, numpy

Show More Show Less

Description

Wrote data pre-processing scripts

Normalization: is to clean data to obtain better features
converting all letters to lower or upper case
removing numbers
removing punctuations
removing stop words, sparse terms, and particular / rare words
spelling correction
stemming: removal of suffices,
Lemmatization: converts the word into its root word, rather than just stripping the suffices.

2. Worked on Feature extraction by Tf-idf, CountVectorizer, Singular Vector Decomposition (SVD) and Feature Engineering
3. This is developed using Python, NLP and Machine Learning like Naïve Bays, XGboost.

Show More Show Less

Description

Task was to predict attrition of the employees given the behavioural data and related historic attrition. Decision Tree, Random Forest

Show More Show Less

Description

A main area offocuse is machine learning model that can identify toxicity in on line conversation where toxicity is define as some thing like disrespectful statment. This is NLP project

Show More Show Less

Description

Gennerating a meaningful summery from the different type fo text

Show More Show Less

http//www.linkedin.com/in/smita-paul-37916b179

Description

Built Logistic Regression and Decision Tree models to identify customers likely to default based on demographic and financial data. Evaluated performance metrics like AUC, F1 score, Recall, Precision, and Accuracy.

Show More Show Less

Description

Identified authors of sentences in a dataset using Nave Bayes, text analysis, and feature extraction techniques like Tf-idf, CountVectorizer, and SVD.

Show More Show Less

Description

Applied hierarchical clustering to identify customer segments based on spending behavior. Used dendrogram and k-means clustering to analyze and offer targeted promotions.

Show More Show Less