I would construct a new variable with values A1, A2, A2, B1, etc. but you could do something like this (from memory/untested):
sort cases by var 1 var2.
if( lag(var1)=var1 and lag(var2)=var2) dup = lag(dup)+1.
Sometimes lag() surprises me, but I think the above should work.
On 2/25/2019 4:56 AM, Matteo Ga wrote:
I have a dataset with dupliucated cases that could be identified by 2 variable.
Case -- var1 --- var2
1 -- A --- 1
2 -- A --- 2
3 -- A --- 2
4 -- B --- 1
5 -- B --- 2
I want to find (and then remove) any cases like 3
I searched online but I couldn't find any way how to do that.
Pspp-users mailing list
Alan D. Mead, Ph.D.
President, Talent Algorithms Inc.
science + technology = better workers
"You're an interesting species. An interesting mix.
You're capable of such beautiful dreams, and such
horrible nightmares. You feel so lost, so cut off,
so alone, only you're not. See, in all our
searching, the only thing we've found that makes
the emptiness bearable, is each other."
-- Carl Sagan, Contact