Under the Indian Languages Corpora Initiative phase –II (ILCI Phase-II) project, initiated by the MeitY, Govt. of India, Jawaharlal Nehru University, New Delhi had collected Gujarati monolingual text corpus. There are approx. 30,000 sentences of general domain in this corpus. These sentences have been POS tagged and Chunked properly. The chunking guideline is provided in supporting document. This corpus has following features: unique ID, UTF-8 encoding, and text file format.