With digital photography there is often a difference between "technically correct" and "visually correct" (in terms of captures).
Digital sensors are linear devices and what that means is a full HALF of all information captured is captured in the top 1 stop of the sensors range; so in most cases (there are exceptions, but I won't go into that much detail at this stage) you want the histogram to be touching the right-hand edge (as it appears to be) - but - without a "spike" right at the end (which you don't have).
However - if you're capturing, say, 11 stops of dynamic range - but your scene only needs about 4 to describe it (a typical reflective scene), then you can end up with what should be shadows biased more towards midtones, which from a technical point of view is great -- it just doesn't look very good!