ππ«I have enlisted a few academic projects undertaken by me as a student at USC
π»ππProjects
ππ§βππ©βπ During my Master of Science in Computer Science (Data Science) at USC, I had the opportunity to work on a variety of academic projects that challenged me and expanded my knowledge in the field. π€π»π These projects allowed me to apply the concepts I learned in the classroom to real-world situations, and collaborate with classmates from diverse backgrounds. πͺπ€ The program's focus on both theoretical and practical aspects of data science equipped me with a well-rounded skill set. π―π I am grateful for the hands-on experience and the chance to work on cutting-edge technologies, and I am confident that the education I received at USC has set me up for success in my future endeavors. ππΌ
Kernel Programming on Weenix OSπ§βπ»π§
The CSCI 402 Operating Systems course at USC included a series of challenging kernel assignments which involved the implementation of various parts of a Weenix-based operating system. π§βπ»π‘
These assignments provided hands-on experience in operating system design and implementation. π€π» The assignments covered topics such as processes and threads, VFS layer and virtual memory and were implemented using C programming language and various debugging tools.
π οΈπ¨βπ» These assignments helped students to gain valuable knowledge in system calls, process management, memory management, and file systems, as well as develop problem-solving skills and an understanding of the inner workings of an operating system. ππ‘ This experience was a great addition to students portfolio and would be beneficial for their future endeavors. ππ
Emulation Based Distributed File SystemπΎπ
The EDFS (Emulation Based Distributed File System) project was π¨ developed and tested using a large crime database π΅οΈββοΈ from Los Angeles, containing over 5 million records π and 50 GB of data πΎ. The goal of the project was to improve the efficiency of data retrieval β‘.
Python libraries π such as pymysql, base64, streamlit, and numpy were utilized to implement the EDFS system. Streamlit was chosen as the front-end framework for its simplicity and ease of use compared to other frameworks like Django π. As a result, the project achieved a 50% increase in data retrieval speed π. The use of Streamlit allowed for an intuitive and user-friendly interface π€, making it easier for non-technical users π₯ to access the data stored in the EDFS.
This project showcases my skills in working with large datasets π, implementing complex algorithms π€, and developing user-friendly interfaces π§βπ». It also demonstrates my ability to utilize the latest tools and technologies to achieve efficient and effective results π.
Kaggle Project for CSCI 567: Machine Learningπ€ππ¨βπ»
π€π As part of the CSCI 567 Machine Learning course at USC, a Kaggle competition was conducted with the objective of forecasting store sales using time series data. πͺπ° The dataset, which was obtained from this link "https://www.kaggle.com/competitions/store-sales-time-series-forecasting", involved challenging time series data pertaining to store sales.
The competition involved competing with other class teams to achieve the best public score on Kaggle. π₯π Various machine learning algorithms were applied such as decision trees, and regression techniques like Ridge, Random Forest, Linear and XGBRegressor to develop the model.
π§π In the final model, the best outputs were stacked together from different algorithms, which resulted in high accuracy and a final public score of 0.38212.
π―π― This score placed the team among the top 10 in the class, and first for the mid-term submission of the same when the public score was 0.39101. ππ