/r/q?req.query.q -- Subreddit Search

31,245 Subscribers

does omitted variable bias affect the intercept?

In a model with an intercept, how is the intercept affected by the omitted variable bias if it does at all. Assume a model has an intercept and two variables but the estimated model only uses the intercept and one variable.

2 Comments

2025/02/02
08:56 UTC

Ecological inference

Hello everyone I am looking for some guidance on getting started on ecological inference. Basically i have aggregate data (let's say on county wise voting) and need to make group level inference (let's say racial group).

I have seen some of the work by Gary King but that's several years old and am not sure what's recent and acceptable models/where to get started without getting lost in the weeds. Would really appreciate some help! Thank you so much.

Also -- if there are better ways to do this, that would be fine too! Please help a fellow academic out.

0 Comments

2025/01/31
05:44 UTC

What disciplines should I take between Linear Programing, Data Processing and Computing Finances?

Hi guys, I study Economics and want to be prepared enough to get DS roles focused on econometrics

The current disciplines I studied/will study are:

3 semesters of calculus (my calculus classes are strange, I studied limits, derivatives, integration, multivariated derivatives with optimization problems, and a little bit of linear algebra)

2 semesters of Probability and Statistics, econometrics, panel data econometrics, time series econometrics and Multivariated Analysis.

Those are my current quantitative disciplines

I now need to fill 2 optional disciplines in my curriculum. I'm deciding between:

Data Processing Linear Programming Computing Finances.

I'm studying/studied SQL, Excel, Power BI, Python, R, Algorithms and Data Structures, and some Data Engineering things by myself.

Do you guys think I'm missing any other fundamental discipline that I should search for in my university to take as option? What of the three options above u guys think is best for a data scientist that works with econometrics?

Thx in advance

13 Comments

2025/01/30
22:36 UTC

Is there any application of Martingale theory in economics/econometrics?

8 Comments

2025/01/30
21:34 UTC

Coding help: massive spatial join

Hello. I am a undergrad economist working on a paper involving raster data. I was thinking if anyone can tell me whats the most efficient way to do a spatial join? I have almost 1700000 data points that has lat and long. I have the shapefile and I would like to extract the country. The code I have written takes more than 15 mins and I was thinking if there is any faster way to do this.

I just used the usual gpd.sjoin after creating the geometry column.

Is there any thing faster than that? Please any help would be appreciated.

5 Comments

2025/01/30
01:26 UTC

Which unit root test and cointegration test should I do ?

Dear community, I have n=5 and t=8. And i found that there is presence of CSD and that my panel is homogeneous. In this case, usually we do CIPS and CADF unit root tests, but since my t=8 is very short, it's impossible to do them so please which unit root tests should I do?

3 Comments

2025/01/29
20:31 UTC

Search for an article (index multiplied by some parameter)

Hi, please help find an econometric article, where the dependent variable will be an index (any) and will be multiplied by some other parameter. Standard proportions like GDP/population are not appropriate. If you know any sources, please link below.

2 Comments

2025/01/28
19:41 UTC

Need Help Understanding Name of Event Studies for Master's Project

Hi,

My group is planning to conduct an event study examining the impact of the signing of the Paris Agreement on the share prices of Renewable Energy and Conventional Energy firms (for environmental economics). We had a meeting with our lecturer, who mentioned that event studies in economics have been renamed several times and advised us not to confuse them with the finance version.

One of our teammates took notes during the meeting and wrote down "2-way fixed event study." However, when I search for this term, the only thing that comes up is "Two-Way Fixed Effects Models." Is this the same thing, but with the name merged into one?

Sorry if this is a silly question.

7 Comments

2025/01/27
00:02 UTC

I can’t run the Slope heterogeneity test on Rstudio

Dear community, I cant run any of the slope heterogeneity test, CADF CIPS nuit root tests and westerlund cointegration tests on Rstudio. I cant find the codes I need to use to get the results and I ask ai it tells me that I should move to python or stata since the "linearmodels" package is unavailable for Rstudio. Is that right ? Can someone help please

0 Comments

2025/01/26
14:24 UTC

Struggling econometrics student

Hi guys,

I started studying econometrics and operations research at eur this year and have struggled so far (5.9, 5.7, 6.4, 8.5 /10 in the classes we’ve taken so far). I’m a bit worried that i’ll struggle to find a job afterwards due to my shaky understanding of the content and a lot of people i know have done well so far. Should i switch degrees?

3 Comments

2025/01/25
23:37 UTC

Help please

I am trying to develop an econometric model but I have a problem, I have the S&P 500 as my dependent variable and interest rates, industrial production, CPI and volatility index as my independent variables. My data has a monthly periodicity and my idea was to make a VAR Model but I am not sure, should I use this type of model or there is another that is better?

2 Comments

2025/01/25
22:34 UTC

Data Visualization - building time series models using decision trees

hey - new to this subreddit and not 100% sure if I'm in the right space. I'm building technology to run "what if" scenario analysis. Think Excel meets decision-trees but in the Multiverse. You can run 100s, if not 1000s of time series simulations for everything from financial data through to operational and strategic data. Still a work in progress but publishing more and more "nodes" each week to add to our library of supported use cases.

Honestly - just looking for feedback from folks that are deep into this kind of analysis and that would be interested in helping guide where I go with this approach next. We've built a hammer... and everything looks like a nail ;)

Challenge is that, at least for now, it is too different for the CFO folks (they love their spreadsheets).

Too complex for the average person trying to run rent vs buy simulations.

Looking for that goldilocks zone where there is enough complexity to warrant learning a new tool and where the demographic is technical enough to look past UI/UX and understand/welcome the potential of the tech.

My background is Industrial Engineering but spent 25 years in the VFX industry - so very focussed on running capacity scenarios. What if we land 80% of the projects we're bidding? 100%? When do they start? Do we have enough staff of the various roles to meet the need? Can we deliver on time? Starting to expand out into other capacity needs like compute resources or inventory.

I guess the ask is whether this is the right place to hang out or if there are suggestions on communities that might be more hyper-focussed on this type of technology and solutions?

https://preview.redd.it/jkx8juuhj6fe1.png?width=3456&format=png&auto=webp&s=85e7abd9f53ea9906778ae37615dea788437a6c9

3 Comments

2025/01/25
18:05 UTC

Effect of Time in Panel Data

Hi, I am currently running a panel regression, but I wanna ask how I can quantify or simply know if time has an effect in my dependent variable.

Someone told me to run a time fixed effect and use an F test to the time specific effects. But I think my prof want me to know if time as an independent variable is significant, or has an effect.

Help me pls, Im new in panel regression. Also if this helps, I can use both R and stata.

3 Comments

2025/01/25
06:35 UTC

How to interpret a a VAR model with logged and % variables

Hello everyone, I am really in need of anyone's help as it proves for me to be quite a challenge to interpret my results.

For econometrics purposes, I have estimated a VAR model using R, which gave me the following results.

However for my model, I used logreturns, and simple returns for my variables (SR and SPR are in the form of ln = (t/t-1) ), and percentage changes in absolute value for the otheres ( CR, R, L and R are in the form of 0.03 for a 3% change for example).

As such, I am not sure how I should interpret my results. For example, does a 1% (0.01) change in R means that the impact of R on the new SR return will be:
SR t+1 = -0.4795 * 0.01 = -0.004795 ( in logarithm)
or
SR t+1 = 1 - exp( -0.4795 * 0.01) = (-)0.004783 (meaning a decrease in the return of -0.48%)

I use the natural logarithm, and would thank the persons who answer as much as I could

https://preview.redd.it/rh8clfbuczee1.png?width=553&format=png&auto=webp&s=0170945c96de449a9ce3412ac2247299fedac43b

0 Comments

2025/01/24
17:54 UTC

DML researchers want to help me out here?

Hey guys, I’m a MS statistician by background who has been doing my masters thesis in DML for about 6 months now.

One of the things that I have a question about is, does the functional form of the propensity and outcome model really not matter that much?

My advisor isn’t trained in this either, but we have just been exploring by fitting different models to the propensity and outcome model.

What we have noticed is no matter you use xgboost, lasso, or random forests, the ATE estimate is damn close to the truth most of the time, and any bias is like not that much.

So I hate to say that my work thus far feels anti-climactic, but it feels kinda weird to done all this work to then just realize, ah well it seems the type of ML model doesn’t really impact the results.

In statistics I have been trained to just think about the functional form of the model and how it impacts predictive accuracy.

But what I’m finding is in the case of causality, none of that even matters.

I guess I’m kinda wondering if I’m on the right track here

2 Comments

2025/01/24
17:14 UTC

Historical car price data per brand/ model in Germany

Pretty specific request here but I’m sort of at a loss: I am doing a research project on the extent to which eu tariffs on Chinese ev’s are inflationary, the country of interest is Germany.

What I am looking for is prices for all EV’s listed in Germany in 2023-4 and at the start of this year after the tariffs have been implemented. In other words, a BYD dolphin sold for x in 2023 and the price rose to y in Jan 2025, the same for Volkswagen, Citroen, ford, basically all of them.

Does anyone know if there is a database or website that hosts this kind of info? Eurostat, as well as federal German publications don’t have this level of granularity.

Thank you!

1 Comment

2025/01/23
16:21 UTC

Econometrics v AI / ML

Hello, I've recently started getting into AI and ML topics, having had an economics background. Econometrics has been around since the early 20th century and AI and ML seem to draw a lot from that area. Even senior practitioners of AI/ML also tend to be much younger (less tenor).

Curious what everyone thinks about this. Are there valid new ideas being generated or is it the "old" with more available computing power now added. Would you say there is some tension between practitioners of AI / ML and senior quantitative econometricians?

21 Comments

2025/01/23
15:31 UTC

Help needed for SPSS project on Swap Spreads

I am currently working on my thesis on the leading indicator function of interest rate swap spreads on macroeconomic indicators. Unfortunately, I can't get any further with the statistics and would really appreciate potential help. The basic idea is to run regressions, but I don't know how to calculate the lead. Cross-correlation, directly in the model, both? Thanks in advance.

1 Comment

2025/01/23
11:39 UTC

Econometrics program

What is the most commonly used econometrics program in the market?

14 Comments

2025/01/23
10:25 UTC

Econometrics and Operational Research in the Netherlands?

Hi r/econometrics

I'm in doubt which BSc to do. I really wanted to do Computer Science at TU Delft first but now I'm doubting my decision. Maybe I wanna do Erasmus University Rotterdam Econometrics or Applied Maths at Delft (which offers a Finance minor for all BSc). I could also try to do Computer Science again next year but I really want to get into Operational Research later.

Alternatively I could try to get into Aerospace Engineering since I'm enrolled in the selection procedure. Or I could study Econometrics at a uni thats more close to me like VU or University of Amsterdam.

And I would love to get in Tinbergen Institute one day but I don't know which program is best. Which is why I'm interested in TUD studies too. They mention Physics on their website. EUR Econometrics and TUD CS or AE are in English so that would set me up for the language level.

I really just want to get into finance but I don't know what route to take. But I was mainly very passionate about Computer Science and Engineering already too and now I just want to stick to TU Delft or alternatively maybe Erasmus.

8 Comments

2025/01/22
23:01 UTC

Can I get admisson to an msc. in econometrics or statistics in Europe

2 Comments

2025/01/22
18:34 UTC

Should I use 2SLS?

I’m estimating the likelihood a client will accept a quote for decoration work. In my company there is no standard pricing strategy so some managers will price more on one job than the other.

Would it be worth estimating the price as a function of the quote parameters (paint, surface area, plasterboard etc) and using this estimate as the price for the logit regression?

Would no have to check if the residual distribution from the price estimation is normal?

I’m new to econometrics so please help if possible.

9 Comments

2025/01/22
18:16 UTC

Question about non-representative administrative levels in household survey and regression covariates

Hi everyone,

I have a question regarding the implications of having non-representative admin. levels when running a regression with household surveys.

I have datasets which are representative at national and regional levels, but not counties. We want to run regressions where the obs. unit is the household, and one covariate we want to add is temperature shocks at the county level.

However, a colleague (not an statitian nor econometrician) says this is not possible because data is not representative at the county level. However I've seen countless papers use IVs and covariates at lower non-representative levels without issue.

I'd like to understand if this holds some truth in it. I don't think it would invalidate an entire regression. What I would be inclined to think is that, in counties which are not properly represented, if I changed the surveyed household, the impact of climate on that specific observation could change greatly, so for example if 60% if my counties are not represented properly at all and there's great variance, then results might change if I surveyed other HHs randomly.

I'm more of an intermediate-level econometrician, but I was never taught about these topics.

Thanks in advance

0 Comments

2025/01/22
11:27 UTC

In MLR, intuitively, why does zero conditional mean assumption imply that x and u are uncorrelated?

For reference, I am working through Wooldridge's Introductory Econometrics textbook. Part of the Gauss-Markov assumptions is that E(u|x)=0. As part of the derivation of OLS, we use the fact that E(u|x) = E(u) = 0 which means that cov(x,u) = 0. But I've been taking this fact for granted. I still don't intuitively understand why we assume that x and u are uncorrelated given the zero conditional mean.

This brings me to another question. Why does Wooldridge say cov(x, u) = 0 instead of, say, corr(x, u)? In the simple linear regression setting, why is the estimated slope parameter cov(x, y) / var(x) instead of corr(x, y) / var(x)? I think that me asking this question is revealing the fact that I am still not fully understanding the difference between covariance and correlation.

8 Comments

2025/01/21
18:34 UTC

Ideas for Econometrics Undergrad project?

Hi, I am taking my first introductory Econometrics class, and we have to do a research project up to 12 pages. I am having difficulties with finding a good idea and datasets. I want to keep it simple to work with, but not too simple that it would result in a bad grade, does anyone have suggestions?

5 Comments

2025/01/21
11:12 UTC

Ambiguous question

https://preview.redd.it/poldte9zgbee1.png?width=920&format=png&auto=webp&s=5b78433fd87c1ee1d3cfcb32543564e4f79be109

I selected the "None" option since on the second option it says "Under the null", so I assumed that option was referring to homoskedasticity. What are your views on this?

9 Comments

2025/01/21
09:36 UTC

How can I ensure meanginful results when dealing with a small sample (eg: research on ASAEN, BRICS, etc)

Hi I'm doing my research on a sample of small countries but I've been very worried about the validity of my results. So far I'm getting very weird results but I don't mind going back and reworking my dataset but regardless of what I do my sample will be capped less than 30 so I can't take advantage of CLT assumptions with samples.

I've been scouring STATA and basically everyone just says to stick with FE/RE as there's not much I can do. What if I try to increase my T will that alleviate concerns of power in my model?

What can I do?

16 Comments

2025/01/19
16:51 UTC

Exchange rate model

Hey guys, i am working on a paper that aims to estimate the impact of exchange rate on the prices of exports and imports (BoP) in Egypt. So i am at 4 or more models to apply Stochastic frontier model sfgm Smooth transition regression str GARCH Markov switching Which one to apply and based on what also what is the criteria to choose the model noting that all of them worked on the exchange rate volatility

8 Comments

2025/01/19
09:13 UTC

Help for a project

So my dissertation topic is to find the impact of FDIs, FIIs and some other macroeconomic variables on stock indices of 6 different countries and am thinking of going for DSGE modeling to do so. Is there any way I can learn how to use it in R? And if there are any better alternatives could you please recommend those. I also came across something called hodge decomposition which seems fine but I know only have surface level knowledge on it.

1 Comment

2025/01/18
22:08 UTC

Need help regarding time series analysis.

Hello. I am a beginner to time series. I was trying to do a price forecasting for Cotton crop prices by taking the monthly data of the last 10 years. But the price data is available only for the month of January to may and then the month of November and December. There is no market data for other months as cotton is a seasonal crop here. So in this case how can I proceed with time series analysis and how many minimum datapoints should I have to take to run a model?

2 Comments

2025/01/18
05:49 UTC