i have simple dataframe:
id name state 1 john dc 1 john va 2 smith ne 3 janet ca 3 janet nc 3 janet md i want delete state value duplicate ids so:
id name state 1 john nan 1 john nan 2 smith ne 3 janet nan 3 janet nan 3 janet nan any idea how solve problem?
thanks,
duplicated returns boolean mask rows duplicated on columns defined in subset. keep=false indicates shouldn't consider first or last of duplicates non-duplicate. using loc allows assign rows duplicates happen.
df.loc[df.duplicated(subset=['id'], keep=false), 'state'] = none df 
Comments
Post a Comment