How to identify similar observations in another group without replacement

Todd Jones

Join Date: Oct 2020

Posts: 28
#1

How to identify similar observations in another group without replacement

19 Nov 2024, 14:37

Say that I have a dataset where some observations have x==0 and the others have x==1. (If it is easier, we can consider these as two distinct datasets.) There are several binary variables, b1, b2, b3, and b4, and two continuous variables, c1, and c2. What is a command that I can use to identify one similar x==0 observation for each x==1 observation (and do so without replacement)? I am flexible with how I define "similar," but it will be based on how close the six binary and continuous variables are. If necessary, I can consider only one of the two continuous variables (c1) and commit to prioritizing similarity in c1 over similarity in the binary variables.

Last edited by Todd Jones; 19 Nov 2024, 14:39.
Tags: None
Rich Goldstein

Join Date: Mar 2014

Posts: 4387
#2

19 Nov 2024, 14:50

there are several user-written routines that would probably be helpful; use -search- to find and download: -calipmatch-, kmatch-, -ultimatch-
1 like
Comment
Todd Jones

Join Date: Oct 2020

Posts: 28
#3

19 Nov 2024, 15:31

Great, thanks!
Comment

Announcement