group_by(CELLPHONE,MEMBERID) %>% filter(row_number() == 1) %>% ungroup()1234
方法二:
>>> data2 <- data7_0 %>%
distinct(CELLPHONE,MEMBERID, .keep_all = TRUE)12
方法三:
>>> data3 <- sqldf("select DISTINCT CELLPHONE,MEMBERID from data7_0")1
方法四:
>>> data4 <- base::unique(data7_0)1
方法五:
>>> data5 <- as.data.table(data7_0[!duplicated(data7_0$CELLPHONE), ])
建议你选择subset函数subset函数,从某一个数据框中选择出符合某条件的数据或是相关的列
(1)单条件查询
>selectresult=subset(df1,name=="aa")
>selectresult
name age sex
1 aa 20 f
>df1
name age sex
1 aa 20 f
2 bb 29 m
3 cc 30 f
(2)指定显示列
>selectresult=subset(df1,name=="aa",select=c(age,sex))
>selectresult
age sex
1 20 f
(3)多条件查询
>selectresult=subset(df1,name=="aa" &sex=="f",select=c(age,sex))
>selectresult
age sex
1 20 f
>df1
name age sex
1 aa 20 f
2 bb 29 m
3 cc 30 f