Home

PCA algorithm calculator

6.5.14. Algorithms to calculate (build) PCA models ..

  1. Algorithms to calculate (build) PCA models. The different algorithms used to build a PCA model provide a different insight into the model's structure and how to interpret it. These algorithms are a reflection of how PCA has been used in different disciplines: PCA is called by different names in each area. 6.5.14.1. Eigenvalue decomposition
  2. The PCA method can be described and implemented using the tools of linear algebra. PCA is an operation applied to a dataset, represented by an n x m matrix A that results in a projection of A which we will call B. Let's walk through the steps of this operation. a11, a12 A = (a21, a22) a31, a32 B = PCA (A) 1. 2
  3. Steps Involved in the PCA. Step 1: Standardize the dataset. Step 2: Calculate the covariance matrix for the features in the dataset. Step 3: Calculate the eigenvalues and eigenvectors for the.
  4. Principal Components Analysis (PCA) is an algorithm to transform the columns of a dataset into a new set of features called Principal Components. By doing this, a large chunk of the information across the full dataset is effectively compressed in fewer feature columns. This enables dimensionality reduction and ability to visualize the separation of classes Principal Component Analysis (PCA.

How to Calculate Principal Component Analysis (PCA) from

by o ine PCA. In the online setting, the vectors x t are presented to the algorithm one by one. For every presented x t the algorithm must output a vector y t before receiving x t+1. The quality of the result, however, is measured in exactly the same way, ALG = min kX 2 Yk F. This paper presents the rst approximation algorithms for this setting. Subtract the initial data with the mean. Calculate the covariance. Calculate eigenvalue and eigenvector. The result data transformations (m x k) and several other references write this algorithm : Prepare the initial data (m x n) Calculate the Mean. Calculate the standard deviation. Count z-score = ( (initial data - mean)/standard deviation PCA or Principal Component Analysis is an unsupervised algorithm used for reducing the dimensionality of data without compensating for the loss of information as much as possible. By extracting. 5. Computing the PCA. There are basically four steps to computing the principal component analysis algorithm: Set up the data in a matrix, with each row being an object and the columns are the parameter values - there can be no missing data. Compute the covariance matrix from the data matrix

Principal Component Analysis, or PCA, is a dimensionality-reduction method that is often used to reduce the dimensionality of large data sets, by transforming a large set of variables into a smaller one that still contains most of the information in the large set. Reducing the number of variables of a data set naturally comes at the expense of. The PCA calculations will be done following the steps given above. Lets first describe the input vectors, calculate the mean vector and the covariance matrix. Next, we get the eigenvalues and eigenvectors. We are going to reduce the data to two dimensions. So we form the transformation matrix A using the eigenvectors of top two eigenvalues matplotlib.mlab.PCA() keeps all \(d\)-dimensions of the input dataset after the transformation (stored in the class attribute PCA.Y), and assuming that they are already ordered (Since the PCA analysis orders the PC axes by descending importance in terms of describing the clustering, we see that fracs is a list of monotonically decreasing. The PCA algorithm is based on some mathematical concepts such as: Variance and Covariance; Eigenvalues and Eigen factors; Some common terms used in PCA algorithm: Dimensionality: It is the number of features or variables present in the given dataset. More easily, it is the number of columns present in the dataset

PCA Algorithm. Calculate the covariance matrix X of data points. Calculate eigenvectors and corresponding eigenvalues. Sort the eigen vectors according to their eigenvalues in decreasing order. Choose first k eigenvectors and that will be the new k dimensions. Transform the original n dimensional data points into k dimensions. Advantages of PCA Recently, Canovas and colleagues [ 3] reported on a new algorithm that incorporates PCA into the Tecnis toric IOL calculator. The authors tested the accuracy of the formula retrospectively on 274. Principal Component Analysis or PCA is a widely used technique for dimensionality reduction of the large data set. Reducing the number of components or features costs some accuracy and on the other hand, it makes the large data set simpler, easy to explore and visualize. Also, it reduces the computational complexity of the model whic

For example, if n_components=0.95, the algorithm will select the number of components while preserving 95% of the variability in the data. When applying PCA, all you need to do is to create an instance of the PCA() class and fit it using the scaled values of X. Then apply the transformation Purpose: To assess the accuracy of toric intraocular lens (IOL) power calculations of a new algorithm that incorporates the effect of posterior corneal astigmatism (PCA). Setting: Abbott Medical Optics, Inc., Groningen, the Netherlands. Design: Retrospective case report. Methods: In eyes implanted with toric IOLs, the exact vergence formula of the Tecnis toric calculator was used to predict.

Understanding Principle Component Analysis(PCA) step by

PCA Algorithm. Principal component analysis is a technique for feature extraction — so it combines our input variables in a specific way, at which point we can drop the least important variables while still retaining the most valuable parts of all of the variables. PCA results in developing new features that are independent of one another A simple way of computing PCA of a matrix X is to compute the eigenvalue decomposition of its covariance matrix. Given N ×D matrix X, a target dimensionality d, the algorithm computes an N ×d matrix V such that the columns of V are the principal components of X Properties and limitations of PCA Properties. Some properties of PCA include: [page needed] Property 1: For any integer q, 1 ≤ q ≤ p, consider the orthogonal linear transformation = ′ where is a q-element vector and ′ is a (q × p) matrix, and let = ′ be the variance-covariance matrix for .Then the trace of , denoted ⁡ (), is maximized by taking =, where consists of the first q. The section after this discusses why PCA works, but providing a brief summary before jumping into the algorithm may be helpful for context: We are going to calculate a matrix that summarizes how our variables all relate to one another. We'll then break this matrix down into two separate components: direction and magnitude This is a small value. It indicates that the results if you use pca with 'Rows','complete' name-value pair argument when there is no missing data and if you use pca with 'algorithm','als' name-value pair argument when there is missing data are close to each other.. Perform the principal component analysis using 'Rows','complete' name-value pair argument and display the component coefficients

PCA is not robust against outliers. Similar to the point above, the algorithm will be biased in datasets with strong outliers. This is why it is recommended to remove outliers before performing PCA. PCA assumes a linear relationship between features. The algorithm is not well suited to capturing non-linear relationships Principal Component Analysis (PCA) algorithm summary. So, to summarize at last the PCA algorithm. After mean normalization to ensure that every feature is zero mean and optional feature scaling which you should really do feature scaling if the features are on very different ranges of values

python - Calculating the percentage of variance measure

Analysis (PCA). PCA is a useful statistical technique that has found application in fields such as face recognition and image compression, and is a common technique for finding patterns in data of high dimension. Before getting to a description of PCA, this tutorial first introduces mathematical concepts that will be used in PCA In order to calculate the PCA, I then do the following: 1) Take the square root of the eigen values -> Giving the singular values of the eigenvalues. 2) I then standardises the input matrix A with the following: A − m e a n ( A) / s d ( A) 3) Finally, to calculate the scores, I simply multiply A (after computing the standardization with. According to Abbott, clinical studies show PCA can add an average of 0.3 D to total corneal astigmatism, so not accounting for PCA in toric lens calculations in patients with astigmatism may result in over-correction or under-correction Calculations were performed using the same vergence formulas as those included in the toric calculator with and without considering the new PCA algorithm. A higher proportion of eyes within lower prediction errors were found when the new PCA algorithm was used How Principal Component Analysis, PCA Works. Whoever tried to build machine learning models with many features would already know the glims about the concept of principal component analysis. In short PCA.. The inclusion of more features in the implementation of machine learning algorithms models might lead to worsening performance issues

Principal Component Analysis (PCA) - Better Explained ML

  1. Introduction. Principal Component Analysis (PCA) is a linear dimensionality reduction technique that can be utilized for extracting information from a high-dimensional space by projecting it into a lower-dimensional sub-space. It tries to preserve the essential parts that have more variation of the data and remove the non-essential parts with fewer variation
  2. PCA (Principal Component Analysis) is a dimensionality reduction technique that was proposed by Pearson in 1901. It uses Eigenvalues and EigenVectors to reduce dimensionality and project a training sample/data on small feature space. Let's look at the algorithm in more detail (in a face recognition perspective)
  3. Conclusion. Principal Component Analysis (PCA), is easier to perform in applications such as R, but there are also some pitfalls, as the R function prcomp does not scales the data values by default. With a little extra effort, PCA can be performed in Excel, but the greatest benefit in doing so is not the PCA, but the greater insight that hands.
  4. Principal Component Analysis Tutorial. As you get ready to work on a PCA based project, we thought it will be helpful to give you ready-to-use code snippets. if you need free access to 100+ solved ready-to-use Data Science code snippet examples - Click here to get sample code The main idea of principal component analysis (PCA) is to reduce the dimensionality of a data set consisting of many.
  5. Randomized PCA algorithms. In this case you can use method pca.run() and skip other steps, like calculation of distances, variances, critical limits and so on. But in this case data matrix must be centered (and scaled if necessary) manually prior to the decomposition. Here is an example using the data generated in previous code

STANDARDS & GUIDELINES. Patient-controlled analgesia (PCA) is a method of providing analgesia using a computerized pump that allows patients to self-administer predetermined doses of opioids. The delivery of small, frequent intravenous boluses of opioids results in reasonably constant serum concentrations of the opioid e.g. 100 x 100 images = 10 000 pixels. Such a huge feature vector will make the algorithm slow. With PCA we can reduce the dimensionality and make it tractable. How. 1) Extract xs. So we now have an unlabeled training set. 2) Apply PCA to x vectors. So we now have a reduced dimensional feature vector z. 3) This gives you a new training set Principal Component Analysis On Matrix Using Python. 30/10/2020. Machine learning algorithms may take a lot of time working with large datasets. To overcome this a new dimensional reduction technique was introduced. If the input dimension is high Principal Component Algorithm can be used to speed up our machines PCA algorithm II (sample covariance matrix) • Given data {x 1, , x m}, compute covariance matrix Σ • PCA basis vectors = the eigenvectors of Σ • Larger eigenvalue ⇒ more important eigenvectors ∑ = m i T m i 1 ( )( ) 1 x x x x ∑ = = m i m i 1 1 where x x SlidefromBarnabasPoczos Wegetthe eigvectorsusingan eigendecomposition

In eyes implanted with toric IOLs, the exact vergence formula of the Tecnis toric calculator was used to predict refractive astigmatism from preoperative biometry, surgeon-estimated surgically induced astigmatism (SIA), and implanted IOL power, with and without including the new PCA algorithm Details. The NIPALS algorithm is well-known in chemometrics. It is an algorithm for computing PCA scores and loadings. The advantage is that the components are computed one after the other, and one could stop at a desired number of components Principal Component Analysis (PCA) in Python using Scikit-Learn. Principal component analysis is a technique used to reduce the dimensionality of a data set. PCA is typically employed prior to implementing a machine learning algorithm because it minimizes the number of variables used to explain the maximum amount of variance for a given data set Indeed, the clustering analyses that follow some PCA calculations can be viewed as one way to assess a strong form of non-normality. Mathematically, circles do have principal axes, but they are just not uniquely determined: you can choose any orthogonal pair of radii as their principal axes. $\endgroup$ - whuber ♦ Sep 16 '10 at 14:1

Principal Component Analysis (PCA) Algorithm - Stack Overflo

Here is a n=2 dimensional example to perform a PCA without the use of the MATLAB function pca, but with the function of eig for the calculation of eigenvectors and eigenvalues. Assume a data set that consists of measurements of p variables on n samples, stored in an n-by-p array. As an example we are creating a bivariate data set of two vectors. In the PCA algorithm, how do we calculate the following? Write the equations if necessary and explain their components. (a) Average square projection error? (b) Total variation in the data? (c) How can we use the measures in (a) and (b) to select the best k. Explain. Note: This a machine learning question, Need Urgent Help ASAP, thank [https://github.com/minsuk-heo/python_tutorial/blob/master/data_science/pca/PCA.ipynb]explain PCA (principal component analysis) step by step and demonstrate.. Strengths: PCA is a versatile technique that works well in practice. It's fast and simple to implement, which means you can easily test algorithms with and without PCA to compare performance. In addition, PCA offers several variations and extensions (i.e. kernel PCA, sparse PCA, etc.) to tackle specific roadblocks

Full lecture: http://bit.ly/PCA-alg To find the eigenvectors, we first solve the determinant equation for the eigenvalues. We then solve for each eigenvector.. PCA(n_components=None, *, copy=True, whiten=False, svd_solver='auto', tol=0.0, iterated_power='auto', random_state=None) [source] ¶. Principal component analysis (PCA). Linear dimensionality reduction using Singular Value Decomposition of the data to project it to a lower dimensional space. The input data is centered but not scaled for each.

The algorithm for Kernel PCA is similar to the one for Dual PCA except that in the case of Kernel PCA neither the training data nor the test data can be reconstructed. Algorithm 3 Recover basis: Calculate K = '(X)>'(X) using the kernel K and let V = eigen-vectors of '(X)>'(X) corresponding to the top d eigenvalues. Let § = diagona Principal Component Analysis (PCA) is one of the most popular linear dimension reduction. Sometimes, it is used alone and sometimes as a starting solution for other dimension reduction methods. PCA is a projection based method which transforms the data by projecting it onto a set of orthogonal axes. Let's develop an intuitive understanding of PCA calculation of accuracy and elapsed time of pso & pca algorithm The accuracy of a classification is usually assessed by comparing the classification with some reference data that is believed to accurately reflect the true land-cover [15]

The Amazon SageMaker PCA algorithm uses either of two modes to calculate these summaries, depending on the situation: regular: for datasets with sparse data and a moderate number of observations and features. randomized: for datasets with both a large number of observations and features. This mode uses an approximation algorithm

PCA works best on data set having 3 or higher dimensions. Because, with higher dimensions, it becomes increasingly difficult to make interpretations from the resultant cloud of data. PCA is applied on a data set with numeric variables. PCA is a tool which helps to produce better visualizations of high dimensional data. End Note Covariance calculations are used to find relationships between dimensions in high dimensional data sets (usually greater than 3) where visualization is difficult. PCA • principal components analysis (PCA)is a technique that can be used to simplify a dataset • It is a linear transformation that chooses a ne PCA use cases By knowing about the PCA and understanding how useful it can be, some machine learning engineers misunderstand PCA use cases and always try to use them. Bad Use PCA (Principal Components Analysis) To Prevent Overfitting. PCA is not made for solving overfitting issues, there is a regularization approach, which is made just for it

How does PCA work? Maths behind PCA Analytics Vidhy

4.1 PCA with 95% variance explanation. Notice the code below has .95 for the number of components parameter. It means that scikit-learn choose the minimum number of principal components such that 95% of the variance is retained. pca = PCA ( .95) # Fitting PCA on the training set only pca.fit (trainX_scaled) You can find out how many components. A PCA (Principal Component Analysis) Algorithm is implemented by following steps-. First it takes sample images as an input. Calculate Face vector for each input image. After Finding the Face vectors it find out the Average Face vector of all input images Face vector. Calculate the Covariance matrix of all input Face vectors

PCA is a dimension reduction method that takes datasets with a large number of features and reduces them to a few underlying features. PCA finds the underlying features in a given dataset by performing the following steps: 1. Calculate the covariance of the matrix of features. 2. Calculate the eigenvectors and eigenvalues of the covariance. Assume a programmer wants to calculate the value of k in a PCA algorithm that retains 99% of the variance. Trying to do it efficiently, he/she calculates the singular value decomposition 4,5,v= svd (Sigma) and the matrix s is found to be: 4.9 0 0 оо 0 0 3.8 0 0 0 0 0 0 0.90 0 S= 0 0 0 0.3 0 0 0 0 0 0 0.1 0 0 0 0 0 0 0.05 Calculate the value. The power iteration algorithm starts with a vector , which may be an approximation to the dominant eigenvector or a random vector.The method is described by the recurrence relation + = ‖ ‖ So, at every iteration, the vector is multiplied by the matrix and normalized.. If we assume has an eigenvalue that is strictly greater in magnitude than its other eigenvalues and the starting vector has. Calculate a size (in KB) of a image file. PCA without scikit-learn: html file-- open in colab. References # Luis Serrano-- [Video] Principal Component Analysis (PCA). It's very intuitive! Stats.StackExchange-- Making sense of principal component analysis, eigenvectors & eigenvalues. Scikit-learn-- PCA official doc Algorithm. Calculate the data mean X = To place initial point click the mouse button in the chosen location, or push the button Random init or the button PCA init. When you add a new point, the new point is joined with the last added point. After learning is finished,.

PCA: Principal Component Analysis Baeldung on Computer

The PCA algorithm analyze data coordinates, from n dimensions and returns orthogonal coordinate representing the main deviation of the data. The main deviation of the data represents the main density of it, this information can be useful to minimize the amount of the dimensions with minimal use of data, as for lossy compression PCA, Clustering and Classification - UPGMA Algorithm •Assign each item to its own cluster •Join the nearest clusters •Reestimate the distance between clusters •Repeat for 1 to n. Hierarchical clustering. •Calculation of the distance between a test sampl One common problem when looking at financial data is the enormous number of dimensions we have to deal with. If we have enough computing power, we will be able to process so much data, but that will not always be the case. Sometimes, we need to reduce the dimensionality of the data. A lot of techniques are available for use and, in this post, we will explore one of them: Principal Component. PCA-Based Algorithm for Generation of Crystal Lookup Tables for PET Block Detectors @article{Breuer2009PCABasedAF, title={PCA-Based Algorithm for Generation of Crystal Lookup Tables for PET Block Detectors}, author={J. Breuer and K. Wienhard}, journal={IEEE Transactions on Nuclear Science}, year={2009}, volume={56}, pages={602-607} Details. Unlike the Possibilistic C-Means (PCM) algorithm requiring the results of a previous run of Fuzzy C-Means (FCM) clustering in order to calculate the parameter Ω, Possibilistic Clustering Algorithm (PCA) is based on the FCM objective function, the partition coefficient (PC) and partition entropy (PE) validity indexes.So that PCA directly computes the typicality values and needs not.

Before implementing the PCA algorithm in python first you have to download the wine data set. Below attach source contains a file of the wine dataset so download first to proceed . Code In Python. Source: Wine.csv. First of all, before processing algorithms, we have to import some libraries and read a file with the help of pandas Analysis (PCA). PCA is a useful statistical technique that has found application in Þelds such as face recognition and image compression, and is a common technique for Þnding patterns in data of high dimension. Before getting to a description of PCA, this tutorial Þrst introduces mathematical concepts that will be used in PCA Live. •. In this Machine Learning from Scratch Tutorial, we are going to implement a PCA algorithm using only built-in Python modules and numpy. We will also learn about the concept and the math behind this popular ML algorithm. All algorithms from this course can be found on GitHub together with example tests Fig. 1 (from Legendre & Legendre 1998) illustrates this algorithm on a very simple case with only two species (descriptors) and five samples. Fig. 2 illustrates the same logic on the data cloud in three-dimensional space (three species/descriptors). Figure 1: PCA ordination of five samples and two species. (Fig. 9.2 from Legendre & Legendre 1998. moments, which we calculate explicitly. The derived asymp-totic results take simple forms and are reasonably accurate in regimes of practical interest. II. PRIVACY-PRESERVING PCA The differentially private PCA algorithm of [1] consists of sampling from the matrix Bingham distribution. In their model, the algorithm takes as input a data matrix X.

A Step-by-Step Explanation of Principal Component Analysis

  1. Principal Component Analysis (PCA) in MATLAB. This is a demonstration of how one can use PCA to classify a 2D data set. This is the simplest form of PCA but you can easily extend it to higher dimensions and you can do image classification with PCA. PCA consists of a number of steps: - Loading the data. - Subtracting the mean of the data from.
  2. ating dimensions for the data, which is in the higher dimension and hard to visualize
  3. This article presents two optimized implementations of the PCA algorithm, primarily targeted on spectral image analysis in real time. One of them utilizes the SSE instruction set of contemporary CPUs, and the other one runs on graphics processors, using the CUDA environment. The implementations are evaluated and compared with a multithreaded C implementation compiled by an optimizing compiler.
  4. Objective. The objective of this study is to reproduce Hoffman's comparison between kPCA, linear PCA, Parzen Density Windows, and the OC-SVM on the 'Breast Cancer' and' Digit 0', datasets. In addition I expand the comparison to four additional (two real, two generated) datasets. ROC curves are preduced and AUCs are calculated for each dataset
  5. ation of the eigenfaces using PCA will be presented. The algorithm described in scope of this paper is a variation of the one outlined here. A detailed (and more theoretical) description of PCA can be found in (Pissarenko, 2002, pp. 70-72). 5.1 Step 1.
  6. Kernel principal component analysis. In the field of multivariate statistics, kernel principal component analysis (kernel PCA) is an extension of principal component analysis (PCA) using techniques of kernel methods. Using a kernel, the originally linear operations of PCA are performed in a reproducing kernel Hilbert space
  7. It extra simply proceeds, and a dependable prediction is returned as the tip product of logical regression and PCA. 15. Implementation of PCA with Python scikit study can be utilized with Python to implement a working PCA algorithm, enabling Principal Part Evaluation in Python 720 as defined above as properly

Principal Component Analysis (PCA) Explained with Examples

Principal Components Analysis Principal components analysis (PCA) is one of a family of techniques for taking common algorithm for PCA. 351. 352 CHAPTER 18. PRINCIPAL COMPONENTS ANALYSIS columns are variables, then xT x = nv, where v is the covariance matrix of the data A PCA algorithm accepts all of the incoming interest data as an input, and it processes the data so that as an output we get a certain set of interest rate tenor points which contribute to around say 97% to 98% of the risk of our interest rate sensitive portfolio Performing Principal Component Analysis (PCA) We first find the mean vector Xm and the variation of the data (corresponds to the variance) We subtract the mean from the data values. We then apply the SVD. The singular values are 25, 6.0, 3.4, 1.9. The total variation is

Principal component analysis (PCA) is routinely employed on a wide range of problems. From the detection of outliers to predictive modeling, PCA has the ability of projecting the observations described by variables into few orthogonal components defined at where the data 'stretch' the most, rendering a simplified overview. PCA is. CUR Algorithm on Data Matrix. Unlike PCA, we must provide CUR algorithm the values of c and k. To improve the efficiency of CUR algorithm with respect to PCA, we ran the function 'CUR' with input parameters c = k = 3, following the suggestion of Hunt explained before Contents:- Introduction Face Recognition Face Recognition using PCA algorithm Strengths & Weaknesses Applications Conclusion Resources 3. Introduction Facial recognition (or face recognition) is a type of biometric software application that can identify a specific individual in a digital image by analyzing and comparing patterns Solving the PCA problem. The following figure shows the basic algorithm to compute a PCA, the interactive visual demo of which appears here. The dataset: Some familiar faces. The dataset consists of a bunch of images of people's faces taken from MIT Faces Recognition Project database. There are 3991 images in total

The algorithm of PCA can be stated as: 1. Compute the mean u of data X. 2. Subtract the mean from X. 3. Calculate the covariance matrix R. 4. Calculate the eigenvalues and eigenvectors of R, choose the several eigenvectors with large eigenvalues. in our paper, we use U u u u 1 2 3 [ , , ], which is enough for the first several frames. In Figure 1 Singular Value Decomposition, or SVD, is a computational method often employed to calculate principal components for a dataset. Using SVD to perform PCA is efficient and numerically robust. Moreover, the intimate relationship between them can guide our intuition about what PCA actually does and help us gain additional insights into this technique 3.2 PCA Based Algorithm The whole recognition process involves two steps: A. Initialization process B. Recognition process The Initialization process involves the following operations: i. Acquire the initial set of face images called as training set. ii. Calculate the Eigenfaces from the training set, keeping only the highest eigenvalues Principal Component Analysis (PCA) Principal Component Analysis (PCA) is one of the most popular linear dimension reduction algorithms. It is a projection based method that transforms the data by projecting it onto a set of orthogonal (perpendicular) axes. PCA works on a condition that while the data in a higher-dimensional space is mapped. Objectives: To develop and validate a risk calculator for prostate cancer (PCa) and clinically significant PCa (csPCa) using explainable artificial intelligence (XAI). Patients and methods: We used data of 3791 patients to develop and validate the risk calculator. We initially divided the data into development and validation sets. An extreme gradient-boosting algorithm was applied to the.

How To Find Eigenvalues Of A MatrixSparse Principal Component Analysis - Programmer SoughtMatrix Calculator Singular Value Decomposition - CULCALINTEREST RATE SIMULATIONS WITH PCA - Risk management inThe James Buchanan Brady Urological Institute Men's Health

4. Calculate PCA: Next we calculate the PCA using the PCA class in C++ (see lines 19-23 in the main function above) and the PCACompute function in Python (see line 23 in the main function above). As an output of PCA, we obtain the mean vector and the 10 Eigenvectors. 5 PCA algorithm has proven better because of its improved PSNR. Average PSNR by PCA algorithm is 42, and Average PSNR by DCT algorithm is approximately 37.5. VII. FUTURE SCOPE The work can be by developing an image technique that will become efficient for compressing images. To enhance better add some GUI design and calculate some mor A Beginner's Guide to Eigenvectors, Eigenvalues, PCA, Covariance and Entropy. This post introduces eigenvectors and their relationship to matrices in plain language and without a great deal of math. It builds on those ideas to explain covariance, principal component analysis, and information entropy. The eigen in eigenvector comes from German. The objective of the proposed method is to explore the performance of hepatitis diagnosis using an algorithm that integrates PCA with Naive Bayes. The proposed method (PCA-NB) is firstly to use PCA in reducing the dimension of the hepatitis dataset, and then the obtained reduced feature subset is served as the input into the designed NB classifier