This might be kind of naive, but you could probably get a close approximation by counting how many inputs there are and multiplying it by the avg size of a signature.
I don't know enough about txs to say how accurate this would be.
I thought about that myself, but it really would not work as signatures have different sizes depending on time(as different encodings were used) and TX type.