如何刪除SQLServer表中重復(fù)記錄

小魚兒363 2016-04-03

展開全文

如何刪除SQLServer表中重復(fù)記錄

　一張表里面以兩個字段為唯一字段，當(dāng)幾條記錄的這兩個字段完全相同時,即出現(xiàn)重復(fù)記錄，需要刪除重復(fù)項，如下面表：
a b c d
1 2 3 4
1 5 3 5
1 2 7 9

create table tb(a int,b int,c int,d int)

insert into tb

select 1,2,3,4 union all

select 1,5,3,5 union all

select 1,2,7,9

　以a、b為唯一字段，第一條和第三條的a、b完全相同，所以，需要刪除第一條記錄1 2 3 4 或者第三條記錄1 2 7 9
即如下結(jié)果：
a b c d
1 2 3 4
1 5 3 5
或
a b c d
1 5 3 5
1 2 7 9

　 SQL語句如下

CREATE TABLE Tb1(id int, [a] varchar(255), [b] varchar(255), [c] varchar(255), [d] varchar(255))
INSERT Tb1(id, [a], [b], [c], [d])
　　　　　 SELECT 1, '1','2','3','4'
UNION ALL SELECT 2, '1','5','3','5'
UNION ALL SELECT 3, '1','2','7','9'
UNION ALL SELECT 4, '1','4','7','6'

del ete Tb1 where [id] not in (sel ect max([id]) fro m Tb1 group by a,b )
sel ect * fro m tb1

drop table tb1

如果要同時刪除第一和第三行
即如下結(jié)果：
a b c d
1 5 3 5

語句如下：

delete * from tb t

inner join

(

select a ,b

from tb

group by a , b

having count(*)>1

on t.a = n.a and t.b = n.b

或

delete tb from tb m,

(

select a ,b

from tb

group by a , b

having count(*)>1

) n

where m.a = n.a and m.b = n.b

　　在幾千條記錄里,存在著些相同的記錄,如何能用SQL語句,刪除掉重復(fù)的呢?
1、查找表中多余的重復(fù)記錄，重復(fù)記錄是根據(jù)單個字段（peopleId）來判斷
sel ect * fro m people
where peopleId in (sel ect peopleId fro m people group by peopleId having count(peopleId) > 1)

2、刪除表中多余的重復(fù)記錄，重復(fù)記錄是根據(jù)單個字段（peopleId）來判斷，只留有rowid最小的記錄
del ete fro m people
where peopleId in (sel ect peopleId fro m people group by peopleId　 having count(peopleId) > 1)
and rowid not in (sel ect min(rowid) fro m people group by peopleId having count(peopleId )>1)

3、查找表中多余的重復(fù)記錄（多個字段）
sel ect * fro m vitae a
where (a.peopleId,a.seq) in (sel ect peopleId,seq fro m vitae group by peopleId,seq having count(*) > 1)

4、刪除表中多余的重復(fù)記錄（多個字段），只留有rowid最小的記錄
del ete fro m vitae a
where (a.peopleId,a.seq) in (sel ect peopleId,seq fro m vitae group by peopleId,seq having count(*) > 1)
and rowid not in (sel ect min(rowid) fro m vitae group by peopleId,seq having count(*)>1)

5、查找表中多余的重復(fù)記錄（多個字段），不包含rowid最小的記錄
sel ect * fro m vitae a
where (a.peopleId,a.seq) in (sel ect peopleId,seq fro m vitae group by peopleId,seq having count(*) > 1)
and rowid not in (sel ect min(rowid) fro m vitae group by peopleId,seq having count(*)>1)

比方說在A表中存在一個字段“name”，而且不同記錄之間的“name”值有可能會相同，
現(xiàn)在就是需要查詢出在該表中的各記錄之間，“name”值存在重復(fù)的項；
Select Name,Count(*) From A Group By Name Having Count(*) > 1

如果還查性別也相同大則如下:
Select Name,--,Count(*) From A Group By Name,-- Having Count(*) > 1
------------------------------------------------------------------------------------------------
declare @max integer,@id integer
declare cur_rows cursor local for sel ect 主字段,count(*) fro m 表名 group by 主字段 having count(*) >； 1
open cur_rows
fetch cur_rows into @id,@max
while @@fetch_status=0
begin
sel ect @max = @max -1
set rowcount @max
del ete fro m 表名 where 主字段 = @id
fetch cur_rows into @id,@max
end
close cur_rows
set rowcount 0

方法二
　　有兩個意義上的重復(fù)記錄，一是完全重復(fù)的記錄，也即所有字段均重復(fù)的記錄，二是部分關(guān)鍵字段重復(fù)的記錄，比如Name字段重復(fù)，而其他字段不一定重復(fù)或都重復(fù)可以忽略。
　　1、對于第一種重復(fù)，比較容易解決，使用
sel ect distinct * fro m tableName
　　就可以得到無重復(fù)記錄的結(jié)果集。
　　如果該表需要刪除重復(fù)的記錄（重復(fù)記錄保留1條），可以按以下方法刪除
sel ect distinct * into #Tmp fro m tableName
drop table tableName
sel ect * into tableName fro m #Tmp
drop table #Tmp
　　發(fā)生這種重復(fù)的原因是表設(shè)計不周產(chǎn)生的，增加唯一索引列即可解決。

　　2、這類重復(fù)問題通常要求保留重復(fù)記錄中的第一條記錄，操作方法如下
　　假設(shè)有重復(fù)的字段為Name,Address，要求得到這兩個字段唯一的結(jié)果集
sel ect identity(int,1,1) as autoID, * into #Tmp fro m tableName
sel ect min(autoID) as autoID into #Tmp2 fro m #Tmp group by Name,autoID
sel ect * fro m #Tmp where autoID in(sel ect autoID fro m #tmp2)
　　最后一個sel ect即得到了Name，Address不重復(fù)的結(jié)果集（但多了一個autoID字段，實際寫時可以寫在sel ect子句中省去此列）
sel ect * fro m tablename where id in (
sel ect id fro m tablename
group by id
having count(id) > 1)