The first thing we do when looking at the summaries of a regression model is to identify which variables are the most important. In that respect, we should distinguish the importance of a variable on its own and the importance of variable as a part of a group of variables. Information theory in combination with statistics allows us to quantify the amount of information each variable provides on its own, and how much does the information provided by two variables overlap.
I will use the notion of the nomogram from two days ago to explain this on the same example of a customer walking into a bank and applying for credit. The bank has to decide whether it will accept or reject a credit. Let us focus on two variables, credit duration and credit amount. We can perform logistic regression for only one variable at a time, but display the effect function on the same nomogram. This looks as follows:

In fact, this is a visualization of the naive Bayes classifier, using loess smoother as a way of obtaining the conditional probability densities P(y|x). But regardless of that, we can see a relatively smooth almost-linear increase in risk, both with increasing duration and with increasing credit amount. In that respect, both variables seem to be about equally good, although duration is better, partly due to the problems with credit amount being leftwards skewed, so the big effects for large credits are somewhat infrequent.
But this is not the right way of doing regression: we have to model both variables at the same time. As the scatter plot shows, they are not independent:

This plot also seems to show that both of them are of comparative predictive power. But now consider the nomogram of the logistic regression model:
>
The coefficient for the credit amount has shrunk considerably! This holds even if we performed Bayesian logistic regression and took the posterior mean as a summary of the correlated coefficients. Why was the credit amount that shrank and not the duration? I find the resolution of the logistic regression model somewhat arbitrary, in the spirit of "winner takes all".
A different interpretation is to use mutual and interaction information as to clarify what is going on. Consider this summary:

The meaning is as follows:
- Duration alone explains 2.64% of the entropy of the risk.
- Credit amount alone explains 2.12% of the entropy of the risk.
- There is a 1.03% overlap between the information both of them provide.
The only problem with this approach is that one needs to construct a reliable joint model of all three variables at the same time as to be able to estimate these information quantities.
More information about this methodology appears in my dissertation.

Thanks for reminding me about nomograms ... they are a great tool and very under used ... the two articles are very good ... nice work ... this blog is getting better and better!