PCA Python Application

Principle Component Analysis

Principle Component Analysis is a technique used to reduce the number of dimensions of your data. In the process you lose some accuracy but increase the simplicity of your data which can make it easier to use.

Steps
1. Scale your numerical variables
2. Center all of your points around the origin
3. Find the line of best fit that goes through the origin but also maximizes the distance from your points projected points on the line, and the origin.
4. This is your first Principle Component and the slope of this line will tell you how important each variable is
     - Create a vector along this line of length 1, this is the Eigenvector
     - The sum of squares of the Eigenvector is the Eigenvalue
     - The amounts each variable contribute to the slope of the Eigenvalue are the Loading Scores
5. Repeat this process creating as many Principle Components as there are variables which each Principle Component being perpendicular to the existing ones

Output Screenshots

1. Uploading file for PCA Analysis

2. Generated Graph from the uploaded file

graph File

Settings of Application

1. You can change variable parameters by changing

variable File

2. You can change PCA parameters by changing

Pca File

Facial Recognition using Deep Learning

Introduction
In today’s world, face recognition has become an integral part of our lives. We use face recognition feature for multi-purposes in our latest gadgets such as FaceID, etc. In this project, we will implement face recognition technique using deep learning concepts. Our aim is to recognize people from the given dataset such as photos or videos. The name of that person will be displayed on the photo or video being displayed. We will accomplish this through four important phases: face detection, posing and projecting faces, encoding (recognizing) faces through basic facial measurements and finding person’s name from encoding. We will use deep convolutional neural networks for encoding faces and also linear support vector machine (SVM) classifier for getting the information about the name of the person and displaying it. We also want to map a person’s face, even when the whole face (50%) is not displayed in the dataset. Thereby, predicting a face and identifying it. In this way, we can identify a person when he/she has worn a face mask, especially in current times of COVID-19 pandemic. We can use this feature in many gadgets and applications for authentication and medical purposes.

Problem Description
Face recognition involves many complex tasks that needs to be divided into stages. We are not just going to recognize faces from a given photo or a live video stream, we will also predict a person's face , when only half of it is given to the system as the data. And then, we will match the predicted face with our database, in order to recognize the face and identify it. Thus, there are four main stages by which we can achieve face recognition using deep learning. Face detection is the first basic stage wherein, we generate histogram of oriented gradients (HOG) representation of the image and then, we find the part of this image that matches or is similar to the HOG pattern extracted from a number of other training faces. In the next stage, i.e., posing and projecting faces – we need to detect faces from images having different poses of the same person. Hence, in this, we will use face landmark estimation algorithm. In this, we will train machine learning algorithm to find 68 specific points on any face, to solve this issue. The third stage, which is the most critical stage, is encoding faces. In this, we will use a number of measurements unique to a person’s face, which will help us identify a face. We use deep convolutional neural network (CNN) for this. Thus, we will be able to identify a person’s face. We will also do all these above tasks manually first, and then will train the machine later. We will also map the facial points of half of the face to the predicted facial points of the other half of the face. After the mapping is successfully done, we then use it to identify the person in the third stage. The last stage is the easiest stage, in which we need to find the name of the person/face that we identified from the database and display it. We use linear SVM model classifier for this. We need to run this classifier on our photo or video, to get the output, i.e, the name of the face of the person identified in the video or image. These combined tasks, enable us to recognize faces of people with high precision.

Outputs Facial Detected Images who wear mask
photos photos photos photos photos photos

Visit project site for detail information by clicking the icon below

House Market Prediction

Abstract
In this project, a machine learning model is proposed to predict the price of house based on the data related to house. This project can help buyer to predict the actual price of the house according to the area, facilities and utilities that include in the house. We are using some regression techniques to reach the goal of the project and including every single step and output to state the project progress. We are using Python programming language with some Python libraries like numpy, sklearn, seaborne, etc. To build machine learning models we need to clean data and able to predict the price of house based on house features in order to analyze and compare model performance to choose the best model that can give accurate result.

Introduction
Everyday thousands of houses are sold, and every buyer have questions like : What is the actual price of the house that I am gone buy? Am I paying a good price? Price of house is depending on size of the house, the year it was built in, location, etc. For the prediction we need to look some statistical data and try to figure out which model will give us an accurate result to predict price. For that we need to study some models and select one of the models for our project to get the perfect result to reach the goal of this project.
To, accomplish, we have pulled the dataset from the Kaggle

Tools Used
• Sklearn
• Numpy
• Matplotlib
• Pandas
• Jupyter Notebook
• Kaggle
• Seaborn

Common Features found
Which features are among the most important features for both XGBoost and Random Forest models and find out the difference in their importance regarding the two models:

Visit project report for detail information by clicking the icon below