Thomas HEITZ
|
2016/ Data Scientist at
Qwam CI: Document Management System, Named Entities Recognition,
Sentiment Analysis, Text Analytics. Tomcat, Java, Groovy, JQuery, MySQL,
Solr, Elastic Search, Kibana, Gate, OpenNLP, d3js, Linux, 10 persons.
2012/2016 - 4 years Data Scientist and Web sites for
companies,
freelance: Facelec, paging,
structuring, photography, photo retouch. HTML, CSS, Javascript.
2010/2011 - 1.5 years Software development
for Ontotext, a leading developer of
semantic technology: name/place/person
extractor, recipes
cooking times/techniques extractor. GATE framework, Java, OWL, UIMA,
Maven, SVN, Agile Scrum, 50 persons.
2007/2009 - 2 years Software development and research, University of Sheffield: work on the GATE framework, one of the main NLP open source frameworks,
ex.
patents search engine (demo). Java,
Grails, GWT, Eclipse, Idea, Ant, SVN, Agile Scrum, 20 persons under the direction of Hamish Cunningham.
2005/2008 - 3.5 years Ph.D. in text mining, Paris-South University: publications, organization
of workshops, development of GATE plugins for text
preprocessing. Java, Perl.
2003/2004 - 8 months Software development and research,
Paris-South University: EXIT, terms extractor used by several research teams
in the world. Java.
2003 - 6 months Software development and research,
Paris-South University: syntactic relations analyzer for French and
English. Perl, 2 persons.
Text Mining: GATE framework,
Regular expressions, Perl, OWL, Brill.
Programming languages: strong kills in Java, HTML, XML,
CSS, Javascript and good knowledge in Grails, GWT, PHP for the most recently
used.
Systems & Softwares: Linux, Windows, MacOS. SVN,
Ant. MySQL, Oracle. Eclipse, IntelliJ Idea. Gimp, OpenGL. LaTex, OpenOffice.
Other: very good knowledge of W3C specifications, attended UI
course and read books like Face 3: The Essentials of Interaction Design.
Languages: French (mother tongue), English
(fluent), German (good), Czech (intermediate), Spanish (beginner),
Korean (beginner), Russian (beginner).
2008 Ph.D. (3.5 years) in computer science, text mining, Paris-South University, under the direction
of Yves
Kodratoff
2004 Research Master's Degree (2 years) in
computer science, second-class honors/cum laude (GPA of 13 out of 20), text
mining, Paris-South University
2002 Bachelor's Degree (3 years) in
computer science, second-class honors/cum laude (GPA of 13 out of 20),
Paris-South University
Reviewed papers in international conferences
2006. Modélisation
du prétraitement des textes.
T. Heitz.
In Proceedings of JADT'06 (International Conference on
Statistical Analysis of Textual Data), volume 1, pages 499-506, 2006.
2004. EXIT: Un
système itératif pour l'extraction de la terminologie du domaine à partir de
corpus spécialisés.
M. Roche, T. Heitz,
O. Matte-Tailliez, Y. Kodratoff. In Proceedings of JADT'04
(International Conference on Statistical Analysis of Textual Data), volume
2, pages 946-956.
Reviewed papers in international workshop
2008. Large-scale,
Parallel Automatic Patent Annotation.
Agatonovic, M.,
Aswani, N., Bontcheva, K., Cunningham, H., Heitz, T., Li,
Y., et al. In Proceedings of 1st International CIKM Workshop on Patent
Information Retrieval - PaIR'08. Napa Valley, California, USA, pages
1-8. Admission rate: 7/16.
2004-2007 Member then co-responsible of the organization committee of the first french text mining challenge DEFT in 2005, 2006 and 2007, 6 to 10 persons. From the finding of the challenge topic and rules to the organization of the workshop through the web site creation, the preparation of the corpora and the evaluation of results.
2005-2006 92.5h
including Master Complementary Competence in Computer Science: Analysis and
Algorithmics, Master 2 Pro: Knowledge Extraction in Texts,
Master 2 Research: Text mining and machine learning.
2004-2005 50h
including Technological University Diploma of Computer Science 2nd year:
Analysis and Conception of Information Systems, Java Programming Project
Writing articles about
knowledge management and classification.
Learning and speaking various languages every week in international meetings.
Developing skills in graphic design and interaction design.
Very good level in 10k running.