I Tried 5 AI Tools Used in Space Research, Here’s What Happened

By My Code Diary

I have a confession. Three months ago, I fell down a rabbit hole at 2 AM reading about how NASA was using machine learning to sift through telescope data. One hour turned into four. By the time I looked up, I had a dozen browser tabs open and a half-cold cup of tea next to my keyboard.

That night planted a question I couldn’t shake: what would it actually feel like to use the AI tools that real space researchers use? Not just read about them, actually run them, break them, and figure out what they’re doing under the hood.

So I spent two weeks doing exactly that. I picked five tools that are genuinely used in planetary science, astrophysics, and space mission planning. Some were brilliant. Some humbled me immediately. All of them taught me something I hadn’t expected.

Here’s an honest account of what happened.

Pro Tip: The fastest way to understand a new AI tool is to feed it data you already understand. If the output surprises you, dig in. If it matches your expectations perfectly, push it harder.

1. Astroquery, Talking Directly to Space Databases

The Problem: Astronomical data is scattered across dozens of observatories and archives. Downloading it manually is tedious, error-prone, and frankly, the kind of work that makes you question your life choices.

What I Expected: A simple wrapper that fetches star catalogs.

What I got: An entire ecosystem.

astroquery is a Python library that lets you query live astronomical databases, NASA’s MAST archive, the ESA Hubble archive, SIMBAD, NED, and more directly from your code. No browser. No clicking. Just structured Python queries that return real mission data.

from astroquery.mast import Observations

obs = Observations.query_object("M31", radius="0.2 deg")
products = Observations.get_product_list(obs[:3])
print(products["productFilename", "size", "dataproduct_type"])

Within five minutes, I was looking at a table of Hubble observations of the Andromeda Galaxy. Actual Hubble data. On my laptop. That moment genuinely stopped me for a second.

The sharp edge came when I tried to filter by instrument. The documentation assumes you already know what HST/ACS or JWST/NIRCam means. If you don’t have some astronomy background, you’ll spend more time on the glossary than the code. Which isn’t a criticism, it’s a professional tool used by professional researchers. It just means the learning curve isn’t in Python; it’s the domain.

What I Learned: Tools built for domain experts have deliberate friction. That friction is often a feature; it forces you to understand the data before you touch it.

2. SpaceML’s Earthrise, Training Vision Models on Satellite Imagery

The Problem: Identifying features in satellite images (cloud cover, land use, storm patterns) is a core task in Earth observation science. Labeling thousands of images by hand is not a realistic option.

What I Expected: A pretrained model I could just call and get results.

What I got: A lesson in transfer learning.

SpaceML’s Earthrise library provides access to curated satellite image datasets and pretrained models specifically designed for Earth observation tasks. The real value is the dataset curation of satellite imagery from NASA and ESA missions, already labeled and structured for training.

from fastai. vision.all import *

path = untar_data(URLs.PETS)  # swap in your satellite dataset
dls = ImageDataLoaders.from_folder(path, valid_pct=0.2, seed=42)
learn = vision_learner(dls, resnet34, metrics=error_rate)
learn.fine_tune(3)

I used a fastai backbone with one of the Earthrise-structured datasets and fine-tuned a ResNet34 to classify cloud types from multispectral imagery. The transfer learning worked better than I expected. Three epochs of fine-tuning on a relatively small labeled set got me to about 87% validation accuracy.

The harder problem was interpreting the output confidently. A model that says “cumulus” with 91% confidence is not the same as a meteorologist who says it. Understanding the calibration of a classification model, not just its accuracy, is something a lot of tutorials gloss over.

What I Learned: Transfer learning from natural images to satellite imagery works surprisingly well. But always look at your confusion matrix before you trust your model.

3. Lightkurve, Finding Exoplanets with Python

The Problem: Detecting exoplanets means looking at tiny periodic dips in stellar brightness data. The signal is real, but so is the noise.

What I Expected: A signal processing library with some astronomy flavor.

What I Got: The closest I’ve ever come to actually doing astrophysics.

Lightkurve is a Python library developed by the Kepler/K2 and TESS mission teams for analyzing stellar light curve data. You can download real photometry data from real stars and look for transit signals yourself.

import lightkurve as lk

search = lk.search_lightcurve("Kepler-10", mission="Kepler")
lc = search[0].download()
lc_clean = lc.remove_outliers().flatten()
lc_clean.fold(period=0.8375).scatter()

I ran this on Kepler-10, a star with a confirmed exoplanet orbiting it every 0.84 days. When I folded the light curve at the correct period, and the transit dip appeared exactly where it should, I felt something I wasn’t expecting. Not just curiosity. Something closer to awe.

The challenge was distinguishing real transits from instrumental noise. Lightkurve has a full Box Least Squares periodogram built in for this. But understanding why BLS works requires you to think carefully about the statistics of periodic signal detection. This is where most beginner tutorials quietly skip to, “and then we found the planet.” The actual work is in that gap.

What I Learned: The tool is elegant. The hard part is always interpretation. Any signal you find needs a physical story before you trust it.

4. NASA’s Open Science Data Repository API, Automating Literature Search

The Problem: Space research generates enormous volumes of publications. Keeping up with relevant literature manually is practically impossible at scale.

What I Expected: A basic REST API that returns paper metadata.

What I got: A genuinely useful automation skeleton.

NASA’s Science Discovery Engine exposes a REST API that lets you programmatically search the NASA Technical Reports Server, the Astrophysics Data System (ADS), and related repositories. Pair it with an LLM, and you have the skeleton of a real research assistant.

import requests, openai

query = "JWST early universe galaxy formation 2024"
url = f"https://sciencediscoveryengine.nasa.gov/api/search?q={query}&rows=5"
results = requests.get(url).json()

titles = [r["title"] for r in results.get("docs", [])]
prompt = f"Summarize these research directions in 3 sentences:\n{titles}."

response = openai.chat.completions.create(
    model="gpt-4o-mini",
    messages=[{"role": "user", "content": prompt}],
    temperature=0.2
)
print(response.choices[0].message.content)

What surprised me was how much signal was in the metadata alone, titles, keywords, author clusters, and citation patterns. You don’t always need to read the full paper to understand where a research community’s attention is pointing.

The API has rate limits, and the documentation occasionally lags behind the actual endpoint behavior. Budget time for that. Every external API will do this to you at some point.

What I Learned: Combining a domain-specific data API with a general-purpose LLM is genuinely powerful and easier to build than most people think. The complexity isn’t in the AI. It’s in cleaning and structuring what comes back from the API.

5. PySyft + Space Data, Federated Learning Without Leaving Earth

The Problem: Space agencies often can’t share raw mission data across institutions for legal or operational reasons. But they still want to train shared models.

What I Expected: This one to be too academic to be practical.

What I got: An unexpected answer to a real engineering problem.

PySyft is a library for privacy-preserving machine learning, specifically, federated learning, where a model is trained across multiple data sources without the data itself ever being centralized. Space agencies have begun exploring this for collaborative science across borders.

import syft as sy

node = sy.orchestra.launch("space-node", port=8080, reset=True)
client = node.login(email="researcher@agency.gov", password="demo")

data = client.upload_dataset(
    name="Kepler Photometry Subset",
    data={"light_curves": lc_array},
    description="Stellar light curves for collaborative transit detection."
)

I ran a simplified simulation of two local “nodes” representing two fictional observatories, each holding different slices of a dataset. The federated model was trained on both without either node sharing raw data with the other. The aggregated model was measurably better than either local model alone.

Is this plug-and-play for production? Absolutely not. Setting up a real PySyft network involves infrastructure decisions that go well beyond a weekend project. But the conceptual shift it forces is valuable on its own: thinking about data not as something you download, but as something you visit.

What I Learned: Privacy-preserving ML is closer to production-ready than I expected. And the problems it solves are real, not just academic.

What Ties All Five Together

Every one of these tools was built to solve a specific problem that someone was actually stuck on. Not “let’s see what this new library can do.” Someone needed to process photometry data at scale, Someone needed to query fifty archives without writing fifty different HTTP clients. Someone needed to collaborate without centralizing sensitive mission data.

That’s the pattern worth stealing. Good tools and good projects start with a specific frustration.

The other thread running through all five: the AI is rarely the hard part. What’s hard is understanding the data well enough to know when the model is lying to you. A 91% accurate classifier on satellite imagery is only useful if you know what the 9% failure mode looks like, and why it fails there.

Start with the problem. Learn the data before you trust the model. And if you find yourself at 2 AM with four browser tabs open and cold tea next to your keyboard, you’re probably on the right track.

Drop your questions in the comments.