Handling human stubbornness when people think they are smarter than data science!

Last month, we had a problem-solving day talking about handling human stubbornness during the implementation of data science. You may hear data science could make good decisions, like data science help groceries to make a better decision. And you may be also familiar with the travel route planned by Google map! Like us, delivery companies also plan an optimized route for drivers. For example, drivers for delivery companies (e.g. Amazon) have to deliver hundreds of parcels to many different addresses every day. And these companies use vehicle routing models to compute the best routes for delivery drivers (maybe the routes has the least time).

At left is a delivery route computed by the Last Mile team’s optimization software, at right the route that a delivery driver actually chose to drive. (Map details have been omitted.) Green symbols *(A and B)* indicate the driver’s starting locations, purple symbols *(also A and B)* the ending locations. – cited from Amazon

However, usually, drivers deviate from the optimised delivery route computed by data science to reduce the journey time. This is because usually, they think they are more familiar with this area and have their own driving habits. Unfortunately, most time, they increase the time required to unload packages from the van at stops.

Stubbornness Illustrations ~ Stock Stubbornness Vectors | Pond5

Data scientists always consider how accurate their model is but paying less consideration to monitor the implementation of the whole process. However, make sure the whole process as the plan is much harder than designing an efficient algorithm. Recently, Amazon and MIT hold a new competition, that they want to find a solution that reducing the probability of drivers’ deviation.

Our group discussed this problem and considered 2 possible ways to help improve the drivers’ loyalty:

Specific personalised driver routes

Most drivers are deviating from the route plan since they have their own driving habits. So, why not design an optimised route combined with their driving habits?

Skoda trials in-car alerts for discounts on fuel, food and more | The Scotsman

We could collect information:

Driver information: driving experience, driving years, age, familiarity with delivering area, customer satisfaction, etc…..
Feedback from the driver: satisfaction about the route plan, reasons for deviations, unusual traffic report, etc…
GPS data: track the deviation, estimated optimal time and real-time.

With these data, reinforcement learning could learn how to design an optimised route incorporating drivers’ preference.

Reward and penalty system

This idea is not related to techniques but psychology. It is also useful if we could set a reward and penalty system:

Positive Discipline – Don`t let your self be so concerned of raising a good kid that you forget you already have one.

•Reward drivers for loyalty to prescribed routes and reward drivers who deviate the route but finish their travel in a time shorter than optimised time.

•Penalties for deviations which cause delays

We only came up few ideas during the problem-solving day. There still are other good ideas to tackle this issue, and the competition of Amazon is operating now! During the discussion, we are given a video, and we found this is really helpful! (after 32 minutes, this video talks about the specific personalized route planning)