Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Pages

Posts

Future Blog Post

less than 1 minute read

Published:

This post will show up by default. To disable scheduling of future posts, edit config.yml and set future: false.

Blog Post number 4

less than 1 minute read

Published:

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 3

less than 1 minute read

Published:

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 2

less than 1 minute read

Published:

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 1

less than 1 minute read

Published:

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

portfolio

publications

Backwards Planning from Onward Task Demonstrations via Vision-Language Models

Published in , 1900

In this paper, we propose a novel method of backward planning using visual-language models (VLM). Previous work on backward planning applied traditional methods that ignore the semantic meaning of manipulation tasks. Our proposed framework utilizes VLMs’ semantic understanding and physical reasoning capabilities to infer backward plans by analyzing onward task executions. Our method explores the barebone usage of these models and provides a comprehensive ablation study comparing the planning capabilities of common closed-source VLMs. We demonstrate that our system reaches an 80% success rate in two robotic manipulation tasks. We also observe that several state-of-the-art VLMs struggle significantly in visual understanding. This limitation still necessitates external embodiments for robust execution. However, the observed planning capabilities suggest that effective backward planning may not require highly complex architectures.

Recommended citation: Gamsız, A. F., Akkoç, D. B., Yıldırım, Y., & Uğur, E. (2025). Backwards planning from onward task demonstrations via vision-language models [Manuscript submitted for review].
Download Paper

May I Ask a Question? MIA40K: A Large-Scale Educational Conversation Dataset and Generation Pipeline

Published in , 1900

Large Language Models (LLMs) have shown significant promise in educational applications, but their full potential is constrained by the limited availability of high-quality educational dialogue data, as traditional data collection methods rely heavily on human involvement. This paper presents a fully automated, highly scalable pipeline for generating educational conversation datasets. Our multi-step framework incorporates solution generation, verification, and dialogue synthesis with LLM-as-a-judge filtering to ensure quality control. Using this pipeline, we introduce MIA40K, a dataset of 39,526 teacher-student conversations focused on mathematics and science education. We evaluate our dataset’s conversational and educational quality through standard metrics and demonstrate its utility in educational dialogue tasks.

Recommended citation: Gamsız, A. F., Köksal, A., Korhonen, A., & Schütze, H. (2024). May I Ask a Question? MIA40K: A Large-Scale Educational Conversation Dataset and Generation Pipeline [Work in progress]
Download Paper

talks

teaching

Teaching experience 1

Undergraduate course, University 1, Department, 2014

This is a description of a teaching experience. You can use markdown like any other post.

Teaching experience 2

Workshop, University 1, Department, 2015

This is a description of a teaching experience. You can use markdown like any other post.