Python NLP | Codersbrain
contractual
Posted on April 11, 2025
Job Description
Python NLP
Job Summary
We are seeking a skilled Python NLP specialist to develop and implement solutions for extracting and processing data from PDFs. The role involves utilizing NLP libraries to analyze and structure text data, optimizing processes for accuracy and efficiency, and collaborating with cross-functional teams to refine data processing workflows.
Responsibilities
- Develop and implement Python-based solutions for extracting and processing data from PDFs.
- Utilize NLP libraries such as spaCy, NLTK, or similar to analyze and structure text data.
- Optimize text extraction processes for accuracy and efficiency.
- Work with structured and unstructured data formats to transform extracted data into usable insights.
- Debug and resolve issues related to text parsing and extraction.
- Collaborate with cross-functional teams to refine data processing workflows.
Qualifications
- Strong proficiency in Python programming.
- Hands-on experience with NLP libraries like spaCy, NLTK, TextBlob, or Transformers and NLP modeling.
- Experience in extracting text from PDFs using tools such as PyMuPDF, PDFMiner, or Tesseract OCR.
- Understanding of regular expressions (RegEx) for text pattern matching.
- Familiarity with data processing and text cleaning techniques.
Preferred Skills
- Knowledge of machine learning techniques for text classification and entity recognition.
- Experience with document OCR tools such as Tesseract or Amazon Textract.
- Familiarity with data storage solutions like SQL, NoSQL, or Pandas DataFrames.
- Exposure to cloud-based NLP services like Google NLP, AWS Comprehend, etc.
Experience
4 to 6 years of relevant experience in Python programming and natural language processing.
Environment
- Location: Remote
- Start Date: 15 days from offer acceptance
- Type: Not specified
Deadline
Applications must be submitted by April 14, 2025.
Skills
- Python Programming
- Natural Language Processing