Why is real-world visual object recognition hard?

Why is real-world visual object recognition hard?