Skip to main content

Dimensionality and High Dimensional data in Machine Learning

Dimensionality and High Dimensional data in Machine Learning


Dimensionality

In Machine Learning and Data Science world dimensionality refers to the number of attributes a dataset has. For example, we have a telecommunication dataset having large numbers of attributes (region, tenure, age, address, etc). Each attribute is written in a CSV file, with each column representing each dimension. Dimensionality in machine learning is different from those which are used in mathematics or science.

High Dimensional Data

High Dimensional data means, the dataset in which the number of features exceeded the number of observations. The dataset has extremely high attributes and makes it more complex for computations. For example, we have ‘n’ numbers of observations or data points and ‘p’ no. of features or attributes. If in a dataset the values of n and p are 1000 and 2000 then the data becomes high dimensional data.

In simple words no matter how big or small is the dataset if the number of observations is greater than the number of attributes (n>p) even if it has values in single digits then also the data is high dimensional data. For computing high dimensional data, one of the most popular machine learning algorithms is Support Vector Machines(SVM). To know details about the SVM algorithm click here.

Dimensionality Reduction

In simple terms dimensionality reduction means simplifying the data for better understanding either numerically or visually. There are different methods to reduce the dimensionality of data like grouping similar data by using tools like multidimensional scaling. To know in-depth knowledge about dimensionality reduction click here.







Comments

Popular posts from this blog

Containerization Within Container

How to Launch Container within Container? In this article we will see how we can use the Concepts of Docker to achieve Containerization within Container. To launch another container within a running container is quite a tedious task but in this article I will show you how easily you can achieve this by applying right concepts. In this article I will be using few concepts of docker like Dockerfile , mounting volume , mounting port and few basic docker commands . Before starting with the container we need to make sure that Docker is installed and configured in our system. If docker is not configured then you can use my automated script which I made for installing and configuring docker from my GitHub Repository . After setting up with docker now we need launch a container so that we can launch another container inside it to achieve Containerization within container. There are two ways to launch container. By using docker pull command By using Dockerfile In this

Operating System and its Types

What is Operating System(OS)? Whenever a person starts learning about computers or any  computer course the first question he is being asked is what is Operating System (OS)? So the answer is Operating System is basically a computer program that manages all other computer program after being installed into the system from the boot menu . In simple words we can say that Operating System is a software which provides a better medium for users to interact with computers. This was all about Operating System(OS) . Now lets talk about the different types of operating systems that we use in our daily life for personal use and for different other purposes.                                                                                   Linux, Windows, Java, MacOS,  Android, Bada, Symbian OS,  Clear OS, are some of the types of  Operating Systems that are being used in our daily life for different purposes. Among these types the most common and most widely used OS is Win

Linux , it's uniqueness and it's distributions.

What is Linux ? Linux is a open source  Operating System which is  modelled on  Unix. Basically Linux is packaged in a form known as a linux distribution for both desktop and server use. It is a family of free and open source software operating systems build on   linux kernal.  How it is different than other OS ? Linux , in simple words is a Operating System as like other O.S such as Windows , iOS , android , etc. which provides you a medium to communicate with machines. However there is a major difference between linux and other O.S .Linux is a open source software. The code in which it is written is accessible to everyone so that user can filter it according to their needs.  What is Linux Distribution ?    Linux has a number of versions  to suits any kind of user. Theses different versions of linux are known as linux distribution ( distros ). Every version of linux can be downloaded for free and can be installed in your desktop. Some of the most popular