Thomas HEITZ
Paris, France.
Mailthomashtz123@gmail.com
Webhttp://thomasheitz.free.fr/

PROFESSIONAL EXPERIENCE

2016/ Data Scientist at Qwam CI: Document Management System, Named Entities Recognition, Sentiment Analysis, Text Analytics. Tomcat, Java, Groovy, JQuery, MySQL, Solr, Elastic Search, Kibana, Gate, OpenNLP, d3js, Linux, 10 persons.
2012/2016 - 4 years Data Scientist and Web sites for companies, freelance: Facelec, paging, structuring, photography, photo retouch. HTML, CSS, Javascript.
2010/2011 - 1.5 years Software development for Ontotext, a leading developer of semantic technology: name/place/person extractor, recipes cooking times/techniques extractor. GATE framework, Java, OWL, UIMA, Maven, SVN, Agile Scrum, 50 persons.
2007/2009 - 2 years Software development and research, University of Sheffield: work on the GATE framework, one of the main NLP open source frameworks, ex. patents search engine (demo). Java, Grails, GWT, Eclipse, Idea, Ant, SVN, Agile Scrum, 20 persons under the direction of Hamish Cunningham.
2005/2008 - 3.5 years Ph.D. in text mining, Paris-South University: publications, organization of workshops, development of GATE plugins for text preprocessing. Java, Perl.
2003/2004 - 8 months Software development and research, Paris-South University: EXIT, terms extractor used by several research teams in the world. Java.
2003 - 6 months Software development and research, Paris-South University: syntactic relations analyzer for French and English. Perl, 2 persons.

COMPETENCES

Text Mining: GATE framework, Regular expressions, Perl, OWL, Brill.
Programming languages: strong kills in Java, HTML, XML, CSS, Javascript and good knowledge in Grails, GWT, PHP for the most recently used.
Systems & Softwares: Linux, Windows, MacOS. SVN, Ant. MySQL, Oracle. Eclipse, IntelliJ Idea. Gimp, OpenGL. LaTex, OpenOffice.
Other: very good knowledge of W3C specifications, attended UI course and read books like Face 3: The Essentials of Interaction Design.
Languages: French (mother tongue), English (fluent), German (good), Czech (intermediate), Spanish (beginner), Korean (beginner), Russian (beginner).

EDUCATION

2008 Ph.D. (3.5 years) in computer science, text mining, Paris-South University, under the direction of Yves Kodratoff
2004 Research Master's Degree (2 years) in computer science, second-class honors/cum laude (GPA of 13 out of 20), text mining, Paris-South University
2002 Bachelor's Degree (3 years) in computer science, second-class honors/cum laude (GPA of 13 out of 20), Paris-South University

MAIN SCIENTIFIC PUBLICATIONS

Reviewed papers in international conferences
2006. Modélisation du prétraitement des textes.
T. Heitz. In Proceedings of JADT'06 (International Conference on Statistical Analysis of Textual Data), volume 1, pages 499-506, 2006.
2004. EXIT: Un système itératif pour l'extraction de la terminologie du domaine à partir de corpus spécialisés.
M. Roche, T. Heitz, O. Matte-Tailliez, Y. Kodratoff. In Proceedings of JADT'04 (International Conference on Statistical Analysis of Textual Data), volume 2, pages 946-956.

Reviewed papers in international workshop
2008. Large-scale, Parallel Automatic Patent Annotation.
Agatonovic, M., Aswani, N., Bontcheva, K., Cunningham, H., Heitz, T., Li, Y., et al. In Proceedings of 1st International CIKM Workshop on Patent Information Retrieval - PaIR'08. Napa Valley, California, USA, pages 1-8. Admission rate: 7/16.

ORGANIZATION OF THE RESEARCH

2004-2007 Member then co-responsible of the organization committee of the first french text mining challenge DEFT in 2005, 2006 and 2007, 6 to 10 persons. From the finding of the challenge topic and rules to the organization of the workshop through the web site creation, the preparation of the corpora and the evaluation of results.

TEACHINGS

2005-2006 92.5h
including Master Complementary Competence in Computer Science: Analysis and Algorithmics, Master 2 Pro: Knowledge Extraction in Texts, Master 2 Research: Text mining and machine learning.
2004-2005 50h
including Technological University Diploma of Computer Science 2nd year: Analysis and Conception of Information Systems, Java Programming Project

OTHER ACTIVITIES

Writing articles about knowledge management and classification.
Learning and speaking various languages every week in international meetings.
Developing skills in graphic design and interaction design.
Very good level in 10k running.