<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE ArticleSet PUBLIC "-//NLM//DTD PubMed 2.7//EN" "https://dtd.nlm.nih.gov/ncbi/pubmed/in/PubMed.dtd">
<ArticleSet>
<Article>
<Journal>
				<PublisherName>Amirkabir University of Technology</PublisherName>
				<JournalTitle>AUT Journal of Electrical Engineering</JournalTitle>
				<Issn>2588-2910</Issn>
				<Volume>53</Volume>
				<Issue>1</Issue>
				<PubDate PubStatus="epublish">
					<Year>2021</Year>
					<Month>06</Month>
					<Day>01</Day>
				</PubDate>
			</Journal>
<ArticleTitle>A Comparison of Two Neural Network Based Methods for Human Activity Recognition</ArticleTitle>
<VernacularTitle></VernacularTitle>
			<FirstPage>17</FirstPage>
			<LastPage>26</LastPage>
			<ELocationID EIdType="pii">4148</ELocationID>
			
<ELocationID EIdType="doi">10.22060/eej.2020.18517.5357</ELocationID>
			
			<Language>EN</Language>
<AuthorList>
<Author>
					<FirstName>Saeedeh</FirstName>
					<LastName>Zebhi</LastName>
<Affiliation>Yazd University</Affiliation>
<Identifier Source="ORCID">0000-0002-2281-7316</Identifier>

</Author>
<Author>
					<FirstName>Seyed Mohammad Taghi</FirstName>
					<LastName>AlModaressi</LastName>
<Affiliation>Yazd University</Affiliation>
<Identifier Source="ORCID">0000-0002-9021-4293</Identifier>

</Author>
<Author>
					<FirstName>Vahid</FirstName>
					<LastName>Abootalebi</LastName>
<Affiliation>Yazd University</Affiliation>

</Author>
</AuthorList>
				<PublicationType>Journal Article</PublicationType>
			<History>
				<PubDate PubStatus="received">
					<Year>2020</Year>
					<Month>05</Month>
					<Day>31</Day>
				</PubDate>
			</History>
		<Abstract>In this paper, two diﬀerent methods of human activity recognition based on video signals are introduced. The first method explores the eﬀectiveness of combining feature descriptors obtained by local descriptors and artiﬁcial neural network classiﬁer. It is used in the traditional approach and the local descriptors extract interest points or local patches from the videos, and the feature vectors are later constructed based on the intrests, and eventually feature vectors are used as the input of a two-layer feed-forward artiﬁcial neural network (ANN). Experimental results show that using the HOG3D descriptor with ANN gives the best performance. On the other hand, deep learning architectures have attracted much consideration for automatic feature extraction in the last years, so an improved 3D convolutional neural network architecture is also designed as the second method. They are implemented and compared with state-of-the-art approaches on two data sets. The results exhibit that method 1 is superior when the shortage of sample data is the main restriction. It respectively achieves recognition accuracies of 97.8% and 99.8% for the Weizmann and KTH action data sets. In addition, method 2 is considerable for its automatic features extraction, and achieves an acceptable result with lots of original training data. As a result, it gets recognition accuracy of 92% for the KTH data set while this value is drastically reduced for the Weizmann data set.</Abstract>
		<ObjectList>
			<Object Type="keyword">
			<Param Name="value">Local descriptors</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">artiﬁcial neural network</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">3D convolutional neural network</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">histogram of oriented gradients 3D (HOG3D)</Param>
			</Object>
		</ObjectList>
<ArchiveCopySource DocType="pdf">https://eej.aut.ac.ir/article_4148_810dfbbebb17302018ae903e9cb7a483.pdf</ArchiveCopySource>
</Article>
</ArticleSet>
