00:00.0
00:05.8
Welcome to excel champs in todays video we are going to learn about six powerful
00:05.9
00:10.7
methods to remove the duplicates from our data first of all we will use pivot tables
00:10.8
00:15.6
to identify the duplicates in our data set now here I have a small example
00:15.7
00:20.5
I have name of few students with the subject and the year of passing out
00:20.6
00:25.4
first of all I'll create a pivot in the same excel sheet I'll go to insert
00:25.5
00:30.3
00:30.3 insert a pivot table select existing worksheet and select a range here
00:30.4
00:35.2
00:35.2 and select the cell F22 and press ok we will put
00:35.3
00:40.1
the name in the rows than subject and than year
00:40.2
00:45.0
now I have put everything in the rows now this looks quite confusing
00:45.1
00:49.9
we'll go to desgin and change the report layout we'll go to show in the tabular form
00:50.0
00:54.8
now here I'll again go to design go to subtotals and
00:54.9
00:59.7
do not show subtotals now here I got the name of the students
00:59.8
01:04.6
the subjects and the year of passing out so these are my unique entries now lets
01:04.7
01:09.5
try this out I have a lsit of 13 students
01:09.6
01:14.4
and here I got the list of 10 students it means this data set contains
01:14.5
01:19.3
three duplicate values so this is how you can refine your data and get the
01:19.4
01:24.2
unique values using a pivot table option now second method is using
01:24.3
01:29.1
power query now first of all it is always advised to covert your
01:29.2
01:34.0
data into a data table how I'll do this I'll simply click anywhere
01:34.1
01:38.9
on my data and press CNRT+T from my keyboard and press enter
01:39.0
01:43.8
now this is changed into a table what I'll do is I'll click on the first cell go to the
01:43.9
01:48.7
data and click on from table range here my power query editor has
01:48.8
01:53.6
opened now there are two methods of removing the duplicates
01:53.7
01:58.5
first of all you can click on this table you will get a few options
01:58.6
02:03.4
and you can simply click on remove duplicates now here I got
02:03.5
02:08.3
my ten unique entries and my duplicate values which were three in count
02:08.4
02:13.2
have been removed now I'll again I'll remove this step again I got my
02:13.3
02:18.1
13 values now the 2nd way is you can select your data complete data set
02:18.2
02:23.0
go to home click on remove rows and remove duplicates
02:23.1
02:27.9
it has again removed my duplicates from my data set and I
02:28.0
02:32.8
got my unique values now I'll close close and load to table on
02:32.9
02:37.7
existing and I'll copy this here so I got my unique values
02:37.8
02:42.6
in the excel sheet now the third method is using the inbuilt function countif
02:42.7
02:47.5
or countifs now here we have only one single criteria so we will use countif
02:47.6
02:52.4
now to use countif first of all we will have to use a function called
02:52.5
02:57.3
textjoin as the name suggest this function will combine the given ranges
02:57.4
03:02.2
into one single cell first of all I'll use textjoin I'll define a delimiter
03:02.3
03:07.1
since I do not need a space so I'll leave this blank I'll put
03:07.2
03:12.0
true since I want to ignore my empty cells and than now I'll define
03:12.1
03:16.9
my range press enter now you see the magic range of converting this
03:17.0
03:21.8
data into a range I did not copy this formula till the end it was
03:21.9
03:26.7
automatically copied till the end now I'll use countif function in this range first of all
03:26.8
03:31.6
the array is range now my range is this column1 and what is my criteria
03:31.7
03:36.5
I just want to check if my this value is repeated in this given range
03:36.6
03:41.4
so I'll press enter again I got 1 this 1 means that
03:41.5
03:46.3
this entry is occuring once and when I select 2 this means that
03:46.4
03:51.2
this text is occuring twice now lets select the 2 these are my
03:51.3
03:56.1
repetative values now the next method is using conditional formatting now here
03:56.2
04:01.0
lets try this conditional formatting I am selecting my data go to home and conditional formatting I'll
04:01.1
04:05.9
select cell rules which are duplicates values now you will see it is going to
04:06.0
04:10.8
highlight my values which are repeted it has highlighted physics because it is
04:10.9
04:15.7
occuring twice or more than two times in my this column but this again does not make
04:15.8
04:20.6
sense beacuse I want the combination of these three cells which
04:20.7
04:25.5
combined are occuring twice so what I'll do is again I'll join my cells
04:25.6
04:30.4
using testjoin delimiter to than I'll select this range and press enter
04:30.5
04:35.3
so my cells are combined now we will use conditional formatting in this
04:35.4
04:40.2
combined cell I'll select my range conditional formatting highlight cells and duplicate values
04:40.3
04:45.1
you can always customize if you want to highlight these in red color
04:45.2
04:50.0
yellow green or you can also choose a custom color border or any kind of format
04:50.1
04:54.9
you like so here we will keep it default and press ok so these are the entries
04:55.0
04:59.8
which are occuring twice or more than two times in my data now the next
04:59.9
05:04.7
method is using the advance filters and go to data I have an option here
05:04.8
05:09.6
advanced which I belive most of you must not have used it
05:09.7
05:14.5
click on advcanced now here I have two options filter the list in place
05:14.6
05:19.4
or copy to another location we will try both the options in this video
05:19.5
05:24.3
first of all I am using filter the list in place and I am selecting this range
05:24.4
05:29.2
which is already selected you can see so no need to change it now I am just clicking on this
05:29.3
05:34.1
unique records only and press ok now you will notice my duplicate
05:34.2
05:39.0
entries are hidden it lies somewhwere between the row 32 and 36
05:39.1
05:43.9
only my unique values can be seen now I'll select this again press
05:44.0
05:48.8
Alt semicolumn cnrtl+c and paste here cntrl+v so
05:48.9
05:53.7
you will see that all my unique values are copied now lets select the second option
05:53.8
05:58.6
I am deleting this table I'll click on name go to data advanced now
05:58.7
06:03.5
I am using the option of copy to another location I'll click copy to another location choose my
06:03.6
06:08.4
range which is already selected copy to let me give this a new location
06:08.5
06:13.3
lets say its F22 unique records only and ok now this has copied only
06:13.4
06:18.2
the unique entries in my data set to my new given range now the last
06:18.3
06:23.1
option is the default option which most of us are using I'll select my range
06:23.2
06:28.0
and you have this option remove duplicates under data tools I'll click on
06:28.1
06:32.9
this this will ask for a confirmation that do you want to remove your duplicates
06:33.0
06:37.8
from all the columns or from a single coulmns since these columns are
06:37.9
06:42.7
interconnected with each other so I'll select all these columns and press ok
06:42.8
06:47.6
so it has removed my three duplicate values and it has found ten unique
06:47.7
06:52.5
names I hope you like this video please do not forget to like comment and share
06:52.6
06:53.9
thank you for watching guys