Publikationen
Zahlreiche Veröffentlichungen sind auch zum Download verfügbar: http://ssrn.com/author=1005596.
Art der Publikation: Beitrag in Zeitschrift
Overcoming poor data quality: Optimizing validation of precedence relation data
- Autor(en):
- Finnah, B.; Otto, A.; Gönsch, J.
- Titel der Zeitschrift:
- European Journal of Operational Research
- Veröffentlichung:
- 2024
- Digital Object Identifier (DOI):
- doi:10.1016/j.ejor.2024.11.009
- Link zum Volltext:
- https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4960613
- Zitation:
- Download BibTeX
Kurzfassung
Insufficient data quality prevents data usage by decision support systems (DSS) in many areas of business. This is the case for data on precedence relations between tasks, which is relevant, for instance, in project scheduling and assembly line balancing. Inaccurate data on unnecessary precedence relations cannot be used, otherwise the recommendations of DSS may turn
infeasible. So, unnecessary relations must be satisfied, diminishing the baseline problem’s solution space and the business result. Experts can validate the data, but their time is limited.
We apply an optimization lens and formulate the data validation problem (DVP). Restricted by the available time budget, an expert dynamically receives queries about specific data entries and corrects or validates them. The DVP searches for an interview policy that states queries to the expert, each using up some of the time budget, in a way that maximizes the (weighted) number of removed precedence relations. We model the DVP as a dynamic program, derive optimal policies for several important special cases and design a heuristic interview policy LSTD. In a case study of an automobile manufacturer, this policy substantially reduces the stations’ idle time after selectively addressing about 8% of the data entries.
We prove theoretically and numerically that data validation by experts can lead to significant savings. The number of queries required to validate the data exhaustively is much less than naive estimates. Additionally, the probability to remove an unnecessary precedence relation per query in a series of queries is high, even for simple interview policies.