The sixth-generation (6G) network is envisioned to integrate communication and sensing functions, so as to improve the spectrum efficiency (SE) and support explosive novel applications. Although the similarities of wireless communication and radio sensing lay the foundation for their combinations, their different requirements for electromagnetic signals make the joint system design a hard task. To simultaneously guarantee sensing accuracy and communication capacity, the multiple-input and multiple-output (MIMO) technique plays an important role, due to its unique capability of spatial beamforming and waveform shaping. However, the configuration of MIMO also brings high hardware cost, high power consumption, and high signal processing complexity. How to efficiently apply MIMO in the joint communication and sensing (JCAS) system is still open. In this survey, we discuss JCAS in the context of MIMO configurations. We first outline the roles of MIMO in the progress of communication and radar sensing. Then, we review current advances in both communication and sensing coexistence and integration in detail. Three novel JCAS MIMO models are subsequently discussed by introducing the promising 6G enablers, i.e., the unmanned aerial vehicle (UAV) and the reconfigurable intelligent surface (RIS). With the aim of building a compatible dual-function system, the benefits and challenges of MIMO in JCAS are summarized in each subsection. Promising solutions are also discussed from the system perspective with simple, intelligent and robust principles. In the end, open issues are outlined to envisage a comprehensive JCAS network in the near future.