TODO: Add funding description page.

This page outlines the background I think one needs to be productive in my lab. My choices reflect my research focus (computational linguistics and applied mathematics) and training (PhD in computational neuroscience and MD specializing in toxicology). You don’t have to know everything here. Someone focusing on linguistics doesn’t need to know about Hilbert or Banach spaces. Mastery comes from directed learning and focused practice.

My Overall Philosphy.

Science educates humanity, in the original meaning of the word educates. The pursuit of science is a rewarding calling with purpose and intellectual fulfillment. The business of science is destructive, feeding and empowering a scientifically illiterate administrative class. This is not a uniquely modern concern. Galileo lauded his patrons at the beginning of his books, usually for about 40-50 pages. Now, from the incisive to the almost banal. As I see it, to succeed in modern science one needs, in order, luck, intellectual honesty, and realpolitick.

I strive to create a lab where people join because they are interested and inspired in joining me on a mutual voyage of discovery. I expect them not to misrepresent their talents or interests to get a spot nor to phone anything in. (Phoning in is 90’s argot that means to do a poor job because you didn’t really try. Perhaps in the era of the smart phone everyone is phoning in.) This expectation comes partly because I think interesting things happen when smart creative people are given the time to be freely productive and partly because I don’t have the bandwidth or inclination to micromanage.

Table of Contents

Fundamentals Everyone Should Know

  1. GitHub. I use GitHub to organize and share code.
  2. Missing Semester. This MIT course teaches you the day-to-day tools you need to do computational science, for example how to move and rename masses of files, quickly edit files with vim, or make a build system.
  3. LaTeX. LaTeX makes beautiful preprints and technical papers. I use Beamer for almost all my presentations. I write as many of my grants in LaTeX as possible. IEEE conferences accept TeX submissions. Biomedical journals sometimes do.
  4. Facility with Python, Prolog, LiSP, Julia, or Haskell.
  5. Anatomy of a Specific Aims Page. The Specific Aims page is the crystallization of your proposal, similar to but far more detailed and precise than an executive summary. All grant applications, whether to private foundations or the NIH, DoD, or NSF, hinge on the Specific Aims page.

Things for the Computational Linguistics People

My focus is on representing biomedical knowledge in a computable format and on developing tools that use those representation to contextualize (and especially assess the plausibility) of knowledge expressed in unstructured text.

Knowledge Representation

  1. Basic Formal Ontology. BFO is a framework for creating first order logic predicates that, when taken together, express a consistent picture about a “portion of reality”. It’s a commonly used standard.
  2. Markov Logic Networks. MLNs express uncertainty and are thus more expressive than BFO. It is an area of ongoing interest to join a BFO-compliant ontology and MLN schema.
  3. SpaCy and Prodigy. They are our NLP backbone.

Things for the Applied Mathematics People

Spatial Statistics