Repository containing portfolio of data science projects completed by me for academic, self learning, and hobby purposes. Presented in the form of iPython Notebooks, and R markdown files (published at RPubs).
For a more visually pleasant experience for browsing the portfolio, check out sajalsharma.com
Note: Data used in the projects (accessed under data directory) is for demonstration purposes only.
Tools: scikit-learn, Pandas, Seaborn, Matplotlib, Pygame
3-way Sentiment Analysis for Tweets: 3-way polarity (positive, negative, neutral) classification system for tweets, without using NLTK's sentiment analysis engine.
Cross language Information Retrieval: Cross language information retrieval system (CLIR) which, given a query in German, searches text documents written in English.
Tools: NLTK, scikit
Tools: Pandas, Folium, Seaborn and Matplotlib
I also dabble in all other kinds of technology. You can find a general portfolio here.
If you liked what you saw, want to have a chat with me about the portfolio, work opportunities, or collaboration, shoot an email at [email protected].
Currently, most (if not all) of the project notebooks use Python 2. I'm currently in the process of updating these to Python 3. Estimated time of completion of these updates is Q1 2022. Contributions for these version updates are welcome. :)
If this project inspired you, gave you ideas for your own portfolio or helped you, please consider buying me a coffee .