Stanford NLP Model Output for Biofuel Patent Classificationdoi:10.7910/DVN/29374Harvard Dataverse2015-03-061Kessler, Jeff, 2015, "Stanford NLP Model Output for Biofuel Patent Classification", https://doi.org/10.7910/DVN/29374, Harvard Dataverse, V1Stanford NLP Model Output for Biofuel Patent Classificationdoi:10.7910/DVN/29374Kessler, JeffHarvard DataverseHarvard Dataverse NetworkJeff Kessler2015-03-062015Biofuel ClassifierNatural Language ProcessingThis NLP model was generated using the Stanford NLP Classifier (available from: http://nlp.stanford.edu/software/classifier.shtml). The model was trained using a random selection of 700 manually classified biofuel patents from 1976 through 2013, and validated against 300 manually classified biofuel patents on January 03, 2014. Included are the classification results and associated patent numbers for both the manually trained patents, and for the automatically categorized patents.19762013United States<a href="http://creativecommons.org/publicdomain/zero/1.0">CC0 1.0</a>Manual Classification.csvThis is the initial list of 1000 patents manually classified for use with training and validating the NLP modeltext/plain; charset=US-ASCIIner-model.ser.gzThis is the model generated by the Stanford NLP Classifierapplication/x-gzipNLP Classification.csvThis is the list of patents and associated classifications based on the NLP model that was trained using the manually classified patentstext/plain; charset=US-ASCIIpatents_test.propThis is the property file used for parameterizing the modeltext/plain; charset=US-ASCII