We use a meta-learning neural-network approach to predict measurement outcomes of a quantum state in arbitrary local bases and thus carry out an approximate quantum state tomography. Each stage of this procedure can be performed efficiently, allowing it to be used effectively on large systems. We demonstrate this approach on the most recent noisy intermediate-scale IBM Quantum devices, achieving an accurate generative model for a 6-qubit state's measurement outcomes with only 100 random measurement settings as opposed to the 729 settings required for full tomography. This reduction in the required number of measurements scales favourably, with around 200 measurement settings yielding good results for a 10 qubit state that would require 59,049 settings for full quantum state tomography. This reduction in the number of measurement settings coupled with the efficiency of the procedure could allow for estimations of expectation values and state fidelities in practicable times on current quantum devices. For suitable states, this could then help in increasing the speed of other optimization schemes when attempting to produce states on noisy quantum devices at a scale where traditional maximum likelihood based approaches are impractical.