Instantly Parse Academic CVs

CVParsa is the automated solution for faculty search committees. Securely and quickly extract education, employment, scholarly interests, and publication data from dossiers and summarize your applicant pool in a simple spreadsheet. Spend your time evaluating candidates, not on tedious data entry.

Get Started Now →

CVParsa capybara sitting among piles of faculty applications, with a very useful spreadsheet hovering above.

CVParsa capybara managing the data extraction process.

Core Features

Deep, Accurate, and Secure Data Extraction

CVParsa is specifically designed to understand academic data, ensuring unparalleled accuracy in parsing complex sections. Our technology goes beyond basic name and contact information to pull out the details that matter to your committee. Stop manually copying data from dossiers, and spend more time evaluating candidates.

•

Useful Data

Person, Interests, Education, and Employment information is standard in all tiers.

•

Publication Counts

Parse all listed publications, and count first and last author papers, and publications in the venues you care most about.

•

Secure Processing

All files are kept strictly confidential on our secure server, which implements AES-256 encryption at rest.

Decision Support

Understand the Applicant Pool

See the composition of your applicant pool, at a glance. CVParsa can automatically generate a visual infographic summary, giving your committee an immediate, bird's-eye view of all candidates' backgrounds and credentials.

•

Save Time

Reduce hours of manual data entry to mere minutes.

•

Evaluate Fairly

Standardized data facilitates objective, apples-to-apples comparisons.

•

Customize Output

Receive a single, clean spreadsheet for easy committee sharing. Choose which columns are included.

CVParsa capybara analyzing the resulting data.

CVParsa in Action

To illustrate CVParsa's capabilities, we collected the public CVs for 15 members of the National Academy of Science (NAS), and passed them to CVParsa.
View the resulting spreadsheet here (as a Google Sheet).

Step 1: Create a ZIP of CVs or dossiers

CVParsa can process individual CVs or entire dossiers / applications.

Step 2: Send the files to CVParsa

Drag and drop your ZIP file onto CVParsa and select the extraction tier.

Step 3: CVParsa does the work

CVParsa securely reads each PDF to extract education, employment, interests, and publications data.

Step 4: Get structured output

Download your structured summary of the applicant pool, and get to work evaluating candidates.

Step 5: Understand the applicants

Download your visual summary of the applicant pool, and see how you did with recruitment.

Ready to Save Your Committee Time?

Upload your ZIP file of CVs or full applications, and get your consolidated spreadsheet back today. Handle 1 applicant or 500, all in one go.

Process My CVs →

Flexible Processing Tiers

Basic

$3/file

Person, Interests, Education, and Employment information.
Publication extraction and metadata.
Infographic summary of applicant pool.

Select Basic

Plus

$5/file

Person, Interests, Education, and Employment information.
Publication extraction and metadata.
Infographic summary of applicant pool.

Select Plus

Premium

$7/file

Person, Interests, Education, and Employment information.
Publication extraction and metadata.
Infographic summary of applicant pool.

Select Premium

Standard pricing is per CV document processed. Multi-use licenses also available; contact licenses@cevianlabs.io for license inquiries.

Frequently Asked Questions

What is the output of the process?

CVParsa produces a spreadsheet summarizing the individual CVs that you upload. Each input file becomes a spreadsheet row, and each row details a set of characteristics extracted from the CV.

Basic tier fields: Name, Gender (inferred), Email, Current job, Current institution, Current dept., Highest degree, PhD institution, PhD advisor, PhD field, PhD year, Interests keywords, Interests summary.

Plus tier fields: All Basic tier fields plus Publication count, 1st author pub count, last author pub count, and (optionally) up three venue-specific publication counts.

Premium tier: All Basic + Plus tier fields, plus a downloadable infographic visualization of the composition of the pool.

Can I hide some fields in the spreadsheet to fit my hiring process's best practices?

Yes, you can choose to omit fields from the output file. For instance, if you want to omit the Gender column, or the PhD institution, to remove gender or prestige signals from the spreadsheet. These options are entered prior to downloading the spreadsheet.

How does CVParsa work?

CVParsa uses a custom stack of modern AI tools to automatically process the semi-structured data in academic CVs. These tools are built on 10 years of expertise by the CVParsa team in analyzing employment, education, and scholarly activity by academic faculty, and leverage industry standard security protocols to process files quickly and maintain their confidentiality throughout.

Briefly, the process works like this:

Create a .zip archive containing the stack of individual CVs or faculty dossiers in PDF format.
Upload the archive to CVParsa for processing on our secure server.
It takes about 5 minutes per 100 files, so get a cup of coffee or answer some email while you wait.
Select which output format options you want.
CVParsa notifies you when you output file is ready to download.

Can I submit one big PDF that contains many CVs?

No. CVParsa cannot split apart PDFs. Each person you want to summarize must have their own PDF. If your pool contains 100 applicants, there should be 100 files, one for each applicant. Files can be full dossiers or applications, or just academic CVs.

Can I submit DOCX or other non-PDF files?

No. CVParsa only works on PDF files at this time.

What languages does CVParsa naturally support?

CVParsa can understand documents in most major languages, and recognize international degrees, institutions, and publishing venues. CVParsa always returns results in English.

What academic fields or areas of study does CVParsa understand?

CVParsa supports all academic fields and areas of study. So long as the CV is structured like an academic curriculum vitae, it should work.

If academics in your field do not publish articles or books, the publication parser will likely produce incomplete output. For instance, at this time, it does not support parsing for artistic performances.

What information does the publication parser extract?

If a publication has a standard bibliographic structure (e.g., a journal article, a peer-reviewed conference publication, a monograph, etc.), CVParsa will try to extract it. All extracted publications are checked against the Crossref database to correct errors when possible, but all extracted publications are counted.

Are multi-use or site licenses available?

Yes. License options for securing multiple CVParsa jobs are available. License inquiries can be directed to licenses@cevianlabs.io

Does Cevian Labs accept invoices or other forms of tax-exempt payment?

Yes. In addition to the option of paying per file parsed, Cevian Labs offers site licenses to institutional partners, and supports tax exempt payments via invoice. Please contact us if you are interested in our site license options. (If using CVParsa requires Cevian Labs being an approved vendor for your university, please contact us with details about that process.)

How does CVParsa handle confidential information?

We take the confidentiality of applicant information extremely seriously. File uploads are encrypted, our secure server uses AES-256 encryption at rest to protect uploaded files, and files are never used to train third party AI models. PDFs are deleted after CVParsa finishes, and original files are retained for only 30 days in order to handle customer support requests.

What information does Cevian Labs retain?

Our data retention policy is detailed in the Privacy Policy for CVParsa. Briefly, copies of some files may be retained for internal audits, service improvement efforts, and as required/permited by law.

To use CVParsa, you must agree to the End User License Agreement (EULA), which grants us permission to use the submitted files to provide you with the resulting summaries.

How accurate is CVParsa?

Academic CVs varies widely in format, styles, and conventions across fields, subfields, and even individuals. CVParsa uses advanced natural language processing algorithms, supported by extensive domain customization to identify and extract relevant information. At the same time, no automated tool is perfect, and CVParsa does make some mistakes.

In our extensive testing on 100s of CVs spanning every academic field, CVParsa is about 99% accurate. In a stack of 100 CVs, CVParsa may make a few dozen mistakes across all the pieces of information it extracts. These mistakes are sometimes in distinguishing different types of publications, or understanding sometimes unique or unusual academic titles. If you notice any parsing mistakes, please let us know, so that we can improve CVParsa's accuracy on that particular type of extrated information.