Logotipo do OpenStreetMap OpenStreetMap

Ideas for a "suspect changesets classifier"

Publicado por naoliv o 17 de Maio de 2014 en English.

Sometimes we find here in Brazil some imported data from +4 months ago, that nobody saw until now. Usually, these imports are followed by some other changesets deleting the old data + changesets modifying/adjusting the imported data.

We also see some changesets where people purposely/unconsciously delete a lot of data.

Could a Bayesian filter, SVM or something else be used to classify a suspect changeset? Could we use something smart for this task?

Icona de correo electrónico Icona de Bluesky Icona de Facebook Icona de LinkedIn Icona de Mastodon Icona de Telegram Icona de X

Parola

Comentario de cartinus no 18 de Maio de 2014 ás 01:42

When using WhoDidIt you can see which changesets contain lots of deletions.

Comentario de naoliv no 18 de Maio de 2014 ás 02:18

The problem is that I can’t manually verify every changeset (and that’s why I am wanting some kind of classifier).

Comentario de Nakaner no 18 de Maio de 2014 ás 13:04

The German user Oli-Wan (a very active German forum member) developes a tool to detect vandalisms and other bad changesets. He has written about his idea/work in German forum. You may contact him in e.g. in German or English.

Comentario de cartinus no 18 de Maio de 2014 ás 19:02

That is why I mentioned WhoDidIt. Changesets with lots of deletions are specially marked. So you won’t have to check them all.

Inicia sesión para deixar un comentario