John Blake
Resources
Course specific resources for my current students are housed on course specific websites, which are provided here. This page provides links to resources grouped topically. Tools created by me, my students, and my research assistants are listed under "Texts and Tools (TNT) lab".
Texts and Tools lab, TNT Lab
- Pronunciation Scaffolder This beta version annotates presentation scripts to help users read their script aloud. Features annotated include pausing, intonation, content words, word stress, tricky sounds and linking between words.
- Error detector This corpus-based genre-specific tool detects errors found in a corpus of information and computer science short research articles. Automated feedback is given for accuracy, brevity, clarity, objectivity and formality errors. Emoticons and an alphanumeric code are inserted into the output. Advice is displayed on hover.
- Language feature detector This tool detects various language features, including: modality (hedges, approximations and boosters), voice, pronouns and articles.
- Move visualizer This tool detects the annotations created using the UAM Corpus Tool to classify the function of each sentence according to a tagset comprising moves and submoves. The annotations are coloured and named.
- Tense identifier (probabilistic beta version). This tool identifies the tense of simple sentences (i.e. with one finite verb) using a hybrid neural network. It was developed in Golang and is deployed on Heroku. Currently, the neural network has trouble discriminating tenses that use "be".
- Preposition visualizer (beta version). This tool highlights prepositions according to their function. It was developed in Golang and is deployed on Heroku.
- Argument visualizer (beta version). This tool visualizes arguments (conclusions, premises and reasoning) and names any formal or inform fallacies present in various famous and infamous texts). This visualizer does not analyze raw text, but needs pre-annotated texts.
- Language feature visualizer. This tool visualizes language features in a pre-loaded pre-annotated corpus of short research articles and academic essays (beta release expected March 2018)
- Automated timeline generator. This tool automatically extracts the finite verb and related time phrases for a sentence, and plots the key details on a time line to help users better see the connection between time and tense. (subject to funding; if funded beta release expected September 2018)
- Online automated authorship analysis. This tool enables uses to identify the probability that an unknown text is written by the same author of a known text or texts. Markers and weights can be selected and set by the user. (fun challenging project; beta release not expected for a while)
- Academic word suggestion machine a predictive text writing tool created by Atsushi Mizumoto et al.
- Difficult and extraneous word finder tool created by Sarah Tyler
- Ginger grammar checker check short texts online. No need to download application.
- Readability checker to check whether a text is easy or difficult to understand
- LexTutor vocabulary profiler to check the balance of academic, general and specialist words in a text
- Writing and presenting scientific papers powerpoint slides from workshop by Tu Bao Ho (2008) aimed at researchers in computer science
- Guide to scientific writing aimed at chemists but particularly relevant to researchers in materials science
- 10 simple rules for getting published, article by Bourne (2005) aimed at computational biologists
- Write that journal article in 7 days, Prezi by Mewburn (2011)
- How to write like a scientist, article by Ruben (2012)
- Common errors in academic (medical) writing, guide by Michelle Biros
- How to write a great research paper, 30-minute lecture given by Simon Peton Jones (2013)
- How to report scientific research to a general audience
- Academic phrasebank commonly occurring phrases found in academic texts
- Common errors in medical writing, guide by Michelle Biros
- Professional medical writing support correlated with quality of written English, research paper published in BMJ
- Online writing tool incorporating corpus-based error detector and language feature detectors, e.g. voice, modality and article detection
- "ed" pronunciation discriminator identifies words ending with "ed" and classifies each matched word according to its pronunciation /d/, /t/ and /id/
- Lexipedia a tool that visualizes collocations of inputted word (en, es, de, fr, nl, it)
- Online graded text editor a tool that visualizes words according to frequency of usage, originally designed for authors of simplified readers)
- Textexture a tool to visualize any text as a network
- Visuwords a visual dictionary, thesaurus and interactive lexicon
- Oxford collocation dictionary
- Top ten questions for the PhD oral exam a checklist of viva issues that always come up by Dunleavy, extracted from his book Authoring a PhD, 2003.
- Move highlighter colour codes texts tagged using my IPMRC annotations in UAM corpus tool
- Parts-of-speech (POS) tag game
- Lexipedia a a tool that visualizes collocations of inputted word (en, es, de, fr, nl, it)
- Selection flowchart a flowchart to enable linguistics to decide which particular field to specialize in; an amusing oversimplification.
- British National Corpus interface for keyword statistics No subscription needed. Dougal Graham, 2014.
- Research article corpus. 6.5 million words from 39 disciples with 20 research papers per discipline. Searches online using ConcGramOnline (Chris Greaves)
- NLP conference calendar list of top computational linguistic and NLP conferences
- Corpus tools 1 Joint portal of Masaryk University`s NLP Centre and Lexical Computing, including NoSketch Engine
- Corpus tools 2 Selection of downloadable tools developed by BFSU
- Computational resources for linguistic research Bill Poser provides a huge selection of links to useful resources.
- Selection grid for OS software agrid using closed questions to enable researchers to select the most appropriate NLP software - nice guide for novices through the minefield
- TextAE an embeddable visual editor of text annotation
- Apache Open NLP a machine learning based toolkit for the processing of natural language text, i.e. an NLP pipeline.
- The GATE product family a set of tools for the general architecture for text engineering
- KH coder free software for quantitative content analysis
- Introduction to statistics for non-mathematicians / linguists
- Log-likelihood calculator (UCREL)
- Bayesian calculator (Fullerton)
- Parallel analysis engine program to determine number of factors to retain in PCA
- Web apps for R created by Atsushi Mizumoto.
- Courses
- simpleR Using R for introductory statistics (Verzani, 2001-2)
- Hands-on tutorial on R for linguistics research
- DataCamp Introduction to R online introductory course
- Quantitative corpus linguistics with R resources (Gries, 2014)
- Resources
- Web apps for R created by Atsushi Mizumoto.
- Google`s R style guide
- Word cloud in R
- Crantastic community site for R packages
- html / css / JavaScript code checker
- JavaScript adventure game
- Regular expression debugger
- Regular expression online tool
- Regular expression library
- Logic and language course using logic to understand language
- Flight finder by Exepedia Japan
- ANA reservations in English
- Japanese Transit Planner provided by Jorudan
- JR Bus Kanto schedule(UoA-Tokyo) for Yumekaido Aizu Liner
- Japan hotel and ryokan site provided by Jalan
- Amazon Japan in English
- Aizuwakamatsu restaurant guide
- Earthquake tracker
- Hourly weather forecast for Aizuwakamatsu
- Fair read by Dubhgan Hinchey. This is a repository of ebooks and audio that can be downloaded. They are formatted beautifully and can be printed.
- English Everywhere RTHK radio show (2002-2004) Recordings in Cantonese and English
- Project Gutenberg Collection of fee ebooks
- Loyal books Collection of free audio books
- Valore books Site that rents, buys and sells textbooks
- Liberal arts reading list by Kyle Pearce
(c) John Blake 2017