|
如何刪除SQLServer表中重復(fù)記錄
一張表里面以兩個字段為唯一字段,當(dāng)幾條記錄的這兩個字段完全相同時,即出現(xiàn)重復(fù)記錄,需要刪除重復(fù)項,如下面表: a b c d 1 2 3 4 1 5 3 5 1 2 7 9
create table tb(a int,b int,c int,d int) insert into tb select 1,2,3,4 union all select 1,5,3,5 union all select 1,2,7,9 以a、b為唯一字段,第一條和第三條的a、b完全相同,所以,需要刪除第一條記錄1 2 3 4 或者第三條記錄1 2 7 9 即如下結(jié)果: a b c d 1 2 3 4 1 5 3 5 或 a b c d 1 5 3 5 1 2 7 9
SQL語句如下
CREATE TABLE Tb1(id int, [a] varchar(255), [b] varchar(255), [c] varchar(255), [d] varchar(255)) INSERT Tb1(id, [a], [b], [c], [d]) SELECT 1, '1','2','3','4' UNION ALL SELECT 2, '1','5','3','5' UNION ALL SELECT 3, '1','2','7','9' UNION ALL SELECT 4, '1','4','7','6'
del ete Tb1 where [id] not in (sel ect max([id]) fro m Tb1 group by a,b ) sel ect * fro m tb1 drop table tb1
如果要同時刪除第一和第三行 即如下結(jié)果: a b c d 1 5 3 5
語句如下:
delete * from tb t inner join ( select a ,b from tb group by a , b having count(*)>1 )n on t.a = n.a and t.b = n.b 或 delete tb from tb m, ( select a ,b from tb group by a , b having count(*)>1 ) n where m.a = n.a and m.b = n.b
在幾千條記錄里,存在著些相同的記錄,如何能用SQL語句,刪除掉重復(fù)的呢? 1、查找表中多余的重復(fù)記錄,重復(fù)記錄是根據(jù)單個字段(peopleId)來判斷 sel ect * fro m people where peopleId in (sel ect peopleId fro m people group by peopleId having count(peopleId) > 1)
2、刪除表中多余的重復(fù)記錄,重復(fù)記錄是根據(jù)單個字段(peopleId)來判斷,只留有rowid最小的記錄 del ete fro m people where peopleId in (sel ect peopleId fro m people group by peopleId having count(peopleId) > 1) and rowid not in (sel ect min(rowid) fro m people group by peopleId having count(peopleId )>1)
3、查找表中多余的重復(fù)記錄(多個字段) sel ect * fro m vitae a where (a.peopleId,a.seq) in (sel ect peopleId,seq fro m vitae group by peopleId,seq having count(*) > 1)
4、刪除表中多余的重復(fù)記錄(多個字段),只留有rowid最小的記錄 del ete fro m vitae a where (a.peopleId,a.seq) in (sel ect peopleId,seq fro m vitae group by peopleId,seq having count(*) > 1) and rowid not in (sel ect min(rowid) fro m vitae group by peopleId,seq having count(*)>1)
5、查找表中多余的重復(fù)記錄(多個字段),不包含rowid最小的記錄 sel ect * fro m vitae a where (a.peopleId,a.seq) in (sel ect peopleId,seq fro m vitae group by peopleId,seq having count(*) > 1) and rowid not in (sel ect min(rowid) fro m vitae group by peopleId,seq having count(*)>1)
比方說在A表中存在一個字段“name”,而且不同記錄之間的“name”值有可能會相同, 現(xiàn)在就是需要查詢出在該表中的各記錄之間,“name”值存在重復(fù)的項; Select Name,Count(*) From A Group By Name Having Count(*) > 1
如果還查性別也相同大則如下: Select Name,--,Count(*) From A Group By Name,-- Having Count(*) > 1 ------------------------------------------------------------------------------------------------ declare @max integer,@id integer declare cur_rows cursor local for sel ect 主字段,count(*) fro m 表名 group by 主字段 having count(*) >; 1 open cur_rows fetch cur_rows into @id,@max while @@fetch_status=0 begin sel ect @max = @max -1 set rowcount @max del ete fro m 表名 where 主字段 = @id fetch cur_rows into @id,@max end close cur_rows set rowcount 0
方法二 有兩個意義上的重復(fù)記錄,一是完全重復(fù)的記錄,也即所有字段均重復(fù)的記錄,二是部分關(guān)鍵字段重復(fù)的記錄,比如Name字段重復(fù),而其他字段不一定重復(fù)或都重復(fù)可以忽略。 1、對于第一種重復(fù),比較容易解決,使用 sel ect distinct * fro m tableName 就可以得到無重復(fù)記錄的結(jié)果集。 如果該表需要刪除重復(fù)的記錄(重復(fù)記錄保留1條),可以按以下方法刪除 sel ect distinct * into #Tmp fro m tableName drop table tableName sel ect * into tableName fro m #Tmp drop table #Tmp 發(fā)生這種重復(fù)的原因是表設(shè)計不周產(chǎn)生的,增加唯一索引列即可解決。
2、這類重復(fù)問題通常要求保留重復(fù)記錄中的第一條記錄,操作方法如下 假設(shè)有重復(fù)的字段為Name,Address,要求得到這兩個字段唯一的結(jié)果集 sel ect identity(int,1,1) as autoID, * into #Tmp fro m tableName sel ect min(autoID) as autoID into #Tmp2 fro m #Tmp group by Name,autoID sel ect * fro m #Tmp where autoID in(sel ect autoID fro m #tmp2) 最后一個sel ect即得到了Name,Address不重復(fù)的結(jié)果集(但多了一個autoID字段,實際寫時可以寫在sel ect子句中省去此列) sel ect * fro m tablename where id in ( sel ect id fro m tablename group by id having count(id) > 1)
|