Practical Considerations for Fine-Tuning BERT-Based Language Models in Health Research: Lessons from Classifying Anti-Vaccine Posts on Social Media

For those transitioning from traditional machine learning to LLMs, it is crucial to understand how choices in data collection and training-including keyword selection, fine-tuning data quantity, stopping conditions, collapsing categories, and labeling conflicts-can significantly impact model performance, especially with smaller datasets.

A Time for Action: Recommendations for Improving Texas Immunization Rates (2016)

This publication is the culmination of more than six months of research – gathering input and feedback from hundreds of immunization stakeholders at town hall meetings and via a survey on the challenges and opportunities they experience as they work to protect Texas communities from vaccine-preventable diseases. The feedback collected, as well as information gathered from state and national government agencies and peer-reviewed papers, is presented in this report as a means to shed light on the state of immunizations in Texas, as well as serve as a road map for all immunization stakeholders – healthcare providers, public health personnel, and legislators alike – on what can be done to protect and improve immunization rates in Texas communities.