1 comments

  • Major_Grooves 5 hours ago ago

    I'm the founder of Tilores, the entity-resolution tool used here - so full disclosure, this is my company's product. This wasn't a paid engagement or a case study. It started because my wife is from Venezuela, and she saw people on social media pointing out that the missing-persons lists had huge numbers of duplicates.

    On the data: these are public citizen-lead efforts to crowd-source the names of the missing - hosted on websites and spreadsheets. There is no official verification process behind the individual entries, which is part of why the duplicate problem existed in the first place.

    An issue we have now realised is "bad actors" trying to access the data...

    Happy to answer anything - methodology, false positives, data handling, whatever.