اوپن سٹریٹ میپ دا لوگو اوپن سٹریٹ میپ

Ideas for a "suspect changesets classifier"

ایہہ ؜17؍؜May ؜2014ء‬ English وچ «naoliv» لیکھ چھپیا گیا سی۔

Sometimes we find here in Brazil some imported data from +4 months ago, that nobody saw until now. Usually, these imports are followed by some other changesets deleting the old data + changesets modifying/adjusting the imported data.

We also see some changesets where people purposely/unconsciously delete a lot of data.

Could a Bayesian filter, SVM or something else be used to classify a suspect changeset? Could we use something smart for this task?

Email icon Bluesky Icon Facebook Icon LinkedIn Icon Mastodon Icon Telegram Icon X Icon

Discussion

ایہہ ؜18؍؜May ؜2014ء ‪01:42‬ تے «cartinus» ٹپݨی کیتی گئی سی۔

When using WhoDidIt you can see which changesets contain lots of deletions.

ایہہ ؜18؍؜May ؜2014ء ‪02:18‬ تے «naoliv» ٹپݨی کیتی گئی سی۔

The problem is that I can’t manually verify every changeset (and that’s why I am wanting some kind of classifier).

ایہہ ؜18؍؜May ؜2014ء ‪13:04‬ تے «Nakaner» ٹپݨی کیتی گئی سی۔

The German user Oli-Wan (a very active German forum member) developes a tool to detect vandalisms and other bad changesets. He has written about his idea/work in German forum. You may contact him in e.g. in German or English.

ایہہ ؜18؍؜May ؜2014ء ‪19:02‬ تے «cartinus» ٹپݨی کیتی گئی سی۔

That is why I mentioned WhoDidIt. Changesets with lots of deletions are specially marked. So you won’t have to check them all.

ٹپݨی چھڈݨ واسطے لوگ‌این کرو